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EXECUTIVE SUMMARY 

The University of Colorado's Laboratory for Atmosphere and Space 
Physics in Boulder, Colorado was the site of the second workshop of 
the Applied Information Systems Research Program (AISRP) . The 
workshop was sponsored by the Information Systems Branch of NASA's 
Office of Space Science and Applications (OSSA) . The purpose of 
this year's workshop was to evaluate progress and discern topics 
for new AISRP research to address OSSA science division needs. To 
this end, AISRP investigators presented their progress to date and 
future direction. OSSA Information Systems Management Board 
science division members or their representatives participated in 
a panel discussion of the scientific needs and problem areas in the 
process of doing their discipline's science. 

The Earth and space science participants were able to see where the 
current research can be applied in their disciplines and computer 
science participants could see potential areas for future 
application of computer and information systems research. The 
Earth and Space Science research proposals for the High Performance 
Computing and Communications (HPCC) program were under evaluation. 
Therefore, this effort was not discussed at the AISRP Workshop. 
OSSA's other high priority area in computer science is scientific 
visualization, with the entire second day of the workshop devoted 
to it. 

Many of the AISRP investigations are intended for multi- 
disciplinary application to the Earth and space sciences. 
Therefore, the presentations were organized into sessions according 
to computer science disciplines. Joe Bredekamp of NASA/OSSA, and 
the program's sponsor, opened the workshop with an overall context 
for the program. Dave Thompson of the Ames Research Center chaired 
the Tuesday morning session on artificial intelligence and related 
areas of expert systems and neural networks. Randal Davis, of the 
University of Colorado and our host, chaired the Tuesday afternoon 
session devoted to data related research, including data 
compression, data archiving, data access, and data analysis. The 
entire day on Wednesday was devoted to the scientific visualization 
session chaired by Mike Botts of the University of Alabama in 
Huntsville. Glenn Mucklow, OSSA's Program Manager for Information 
Systems Research and Technology for Joe Bredekamp, chaired the 
workshop and Thursday's session addressing programmatic issues. 
Thursday included the science discipline panel and a panel on 
closer collaboration and cooperation between the Office of 
Aeronautics and Space Technology (OAST) and OSSA research and 
development activities with application in the Earth and space 
sciences. The closing discussions included plans for new research 
announcements in FY 1993 from the AISRP and the Center of 
Excellence in Space Data and Information Sciences. 
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The workshop addressed issues raised at last year's workshop in 
technology transfer across disciplines and to the broader 
scientific community. Action was taken to establish a software 
support laboratory at the University of Colorado under the 
direction of Randal Davis. This group will provide a minimalist 
capability for testbeds and demonstrations, software tool 
distribution, capturing user experience, and the development of 
data test suites. In addition, progress in other areas identified 
at the last meeting were covered. The progress reported in the 
area of data formats resulted in a splinter group discussion as did 
the areas of data compression and future plans for the Internet. 
Agreements on collaboration were made between Dr. Hansen and Dr. 
Jacobson, and Dr. Emery and Dr. Kinter. OAST agreed to provide 
some funding for AI investigators to work with the OSSA Pi's. 


Glenn H. Mucklow 
Program Manager 

Information Systems Research & Technology 
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An Interactive Environment for the Analysis of 
Large Earth Observation and Model Data Sets 



Principal Investigator: Assistant Professor Kenneth P. Bowman 

University of Illinois 

Co-Investigators: Professor John E. Walsh 

University of Illinois 

Professor Robert B. Wilhelmson 
University of Illinois 


Summary: 

We propose to develop an interactive environment for the analysis 
of large Earth science observation and model data sets. We will 
use a standard scientific data storage format and a large 
capacity (>20 GB) optical disk system for data management; 
develop libraries for coordinate transformation and regridding of 
data sets; modify the NCSA X Image and X DataSlice software for 
typical Earth observation data sets by including map 
transformations and missing data handling; develop analysis tools 
for common mathematical and statistical operations; integrate the 
components described above into a system for the analysis and 
comparison of observations and model results; and distribute 
software and documentation to the scientific community. 
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Interactive Interface for National Center 
for Atmospheric Research (NCAR) Graphics 


Principal Investigator: Dr. William Buzbee 

National Center for Atmospheric Research 

Co-Investigators: Robert L. Lackman 

National Center for Atmospheric Research 


Summary: 

NCAR Graphics is a FORTRAN 77 library of over 30 high-level 
graphics modules which are heavily used by science and 
engineering researchers at over 1500 sites world-wide including 
many universities and government agencies. These Earth science 
oriented modules now have a FORTRAN callable subroutine interface 
which excludes their use by non-programming researchers. This 
proposal outlines the development of a fully interactive "point 
and click" menu-based interface using the prevailing toolkit 
standard for the X-Window System. Options for direct output to 
the display window and/or output to a Computer Graphics Metafile 
(CGM) will be provided. X, PEX, and PHIGS will be implemented as 
the underlying windowing and graphics standards. Associated 
meteorological and geometric data sets would exploit the network 
extended NASA Common Data Format, netCDF. 
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Development of a Tool-Set for Simultaneous, 

Multi-Site Observations of Astronomical Objects 
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Principal Investigator: Dr. Supriya Chakrabarti 

University of California 

Co-Investigators: Dr. J. Garrett Jernigan 

University of California, Berkeley 

Dr. Herman L. Marshall 
University of California, Berkeley 


Summary: 

A network of ground and space based telescopes can provide 
continuous observation of astronomical objects. In a "Target of 
Opportunity" scenario triggered by the system, any telescope on 
the network may request supporting observations. We propose to 
develop a set of data collection and display tools to support 
these observations. We plan to demonstrate the usefulness of 
this toolset for simultaneous multi-site observations of 
astronomical targets. Possible candidates for the proposed 
demonstration include the Extreme Ultraviolet Explorer, 
International Ultraviolet Explorer, ALEXIS, and sounding rocket 
experiments. Ground based observations operated by the 
University of California, Berkeley; the Jet Propulsion 
Laboratory? and Fairborn Observatory, Mesa, Arizona will be used 
to demonstrate the proposed concept. Although the demonstration 
will involve astronomical investigations, these tools will be 
applicable to a large number of scientific disciplines. The 
software tools and systems developed as a result of our work will 
be made available to the scientific community. 
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l'"’ Multivariate Statistical Analysis Software Technologies for 
Astrophysical Research Involving Large Data Bases 


Principal Investigator: Professor Stanislav Djorgovski 

California Institute of Technology 


Summary: 

The existing and forthcoming data bases from NASA missions 
contain an abundance of information whose complexity cannot be 
efficiently tapped with simple statistical techniques. Powerful 
multivariate statistical methods already exist which can be used 
to harness much of the richness of these data. Automatic 
classification techniques have been developed to solve the 
problem of identifying known types of objects in multiparameter 
data sets, in addition to leading to the discovery of new 
physical phenomena and classes of objects. We propose an 
exploratory study and integration of promising techniques in the 
development of a general and modular classification/analysis 
system for very large data bases, which would enhance and 
optimize data management and the use of human research resources. 
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A Land-Surface Testbed for EOSDIS 

Investigator: Dr. William Emery 

University of Colorado 


Co-Investigators : 


Dr. Jeff Dozier 

University of California, Santa Barbara 


Paul ROtar 

National Center for Atmospheric Research 


Summary: 

We propose to develop an on-line data distribution and 
interactive display system for the collection, archival, 
distribution and analysis of operational weather satellite data 
for applications in land surface studies. A 1,000 km2 scene of 
the western U.S. (centered on the Colorado Rockies) will be 
extracted from Advanced Very High Resolution Radiometer (AVHRR) 
imagery collected from morning and afternoon passes of the NOAA 
polar-orbiters at the direct readout stations operated by 
CU/CCAR. All five channels of these AVHRR data will be navigated 
and map registered at CU/CCAR and then be transferred to NCAR for 
storage in an on-line data system. Software will also be 
available at NCAR to process and navigate the raw AVHRR data as 
needed. A display workstation software, based on a Macintosh II 
computer, will be developed that will display and further process 
the AVHRR data for studies of vegetation monitoring and snowpack 
assessment. Various options of presently used techniques for 
both vegetation and snowpack monitoring will be implemented in 
the workstation software to provide the individual investigator 
with the freedom to interact with the satellite image data. The 
display software will be freely distributed online to interested 
investigators and the AVHRR data will be made available on-line 
to anyone interested. In addition, potential users will be 
sought out and connected to the on-line data archive. This 
experiment with an active on-line archive and interactive 
analysis systems will provide experience with a small scale 
EOSDIS. 
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Multivariate Statistical Analysis Software Technologies for 
Astrophysical Research Involving Large Data Bases 


Principal Investigator: Professor Stanislav Djorgovski 

California Institute of Technology 

BIBLIOGRAPHY 


Papers based in part on the work performed under the contract NAS5-31348 so far: 

1. Applications of the Multivariate Statistical Analysis Package: 

“Systematic Differences Between the Field and Cluster Ellipticals”, de Carvalho, FL, and 
Djorgovski, S. 1992, Astrophys. J . Letters 389, L49. 

“On the Formation of Globular Clusters in Elliptical Galaxies”, Djorgovski, S., and Santiago, 
B.X. 1992, Astrophys. J . Letters 391, LS5. 

“Multivariate Analysis of Globular Cluster Systems in Early-Type Galaxies”, Santiago, B.X., 
and Djorgovski, S. 1992, M.N.R.A.S. in press. 

“Systematics of Galaxy Properties: Clues About Their Formation”, Djorgovski, S. 1992, in 
R. de Carvalho (ed.), Cosmolog y and Large-Scale Structure in the Universe , A.S.P. 
Con f. Ser. in press. 

“Dynamical Evolution Effects on the Hot Stellar Populations in Globular Clusters”, Djor- 
govski, S., and Piotto. G. 1992. submitted to the Astron. J . 

“Towards the Solution of the Second Parameter Problem: the Effects of Cluster Density 
and Concentration the Horizontal Branch Morphology”, Fusi Pecci, F., Ferraro, F., 
Bellazzini, M., Djorgovski, S., Piotto, G., and Buonanno, R. 1992, submitted to the 
Astron , J. 

“What Determines the Stellar Mass Functions in Globular Clusters?”, Djorgovski, S., Piotto, 
G., and Capaccioli, M. 1992, submitted to the Astron. J. 

2. Development of the Automatic Classification and Sky Survey Analysis Tools: 

“Towards a Digitized Second Palomar Sky Survey: Initial Reduction and Star/Galaxy Clas- 
sification ”, Weir, N., Djorgovski, S.. Fayyad, U., and Doyle, R. 1991, Bull Am. Astron. 
Soc . 23, 1434. 

“Applying Machine Learning Classification Techniques to Automate Sky Object Cataloguing” 
Fayyad, U M Doyle, R., Weir, N., and Djorgovski, S. 1992, to appear in: Proceedings of 
the International Space Year Conference on Earth Us Space Science Information Systems , 
Pasadena, CA, February 1992 . 

“Automating Sky Object Classification in Astronomical Survey Images”, Fayyad, U., Doyle, 
R., Weir, N m and Djorgovski, S. 1992, to appear in: Proceedings of the Machine Discov- 
ery Workshop, Ninth International Conference on Machine Learning, Aberdeen, Scot- 
land, July 1992. 

“The Palomar Observatory - STScI Digital Sky Survey: I. Program Definition and Status”, 
Djorgovski, S., Lasker, B., Weir, N., Postman, M., Reid, I.N., and Laidler, V. 1992, 
Bull Am. Astron. Soc. 24, 750. 

“The Palomar Observatory - STScI Digital Sky Survey: II. The Scanning Proces”, Lasker, 
B M Djorgovski, S., Postman, M,, Laidler, V., Weir, N., Reid, I.N., and Sturch, C. 1992, 
Bull Am. Astron. Soc. 24, 741. 

“An Analysis of the Palomar Observatory - STScI Digital Sky Survey: Catalog Construction 
and Initial Results”, Weir, N., Djorgovski, S., Fayyad, U-, and Doyle, R. 1992, Bull 
Am. Astron. Soc. 24, 741. 


Additional papers are now in preparation. 
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Geographic Information System for Fusion and Analysis of 
High-Resolution Remote Sensing and Ground Truth 

» - f 

Principal Investigator: Anthony Freeman 

Jet Propulsion Laboratory 

Co-Investigators: Jo Bea Way 

Jet Propulsion Laboratory 

Pascale Du Bois 
Jet Propulsion Laboratory 

Franz Leberl 
VEXCEL Corporation 


Summary: 

We seek to combine high-resolution remotely sensed data with 
models and ground truth measurements, in the context of a 
Geographical Information System, integrated with specialized 
image processing software. We will use this integrated system to 
analyze the data from two Case Studies, one at a boreal forest 
site, the other a tropical forest site. We will assess the 
information content of the different components of the data, 

the optimum data combinations to study biogeophysical 
changes in the forest, assess the best way to visualize the 
results, and validate the models for the forest response to 
different radar wavelengths/polarizations. 

During the 1990 's, unprecedented amounts of high-resolution 
images from space of the Earth's surface will become available to 
the applications scientist from the LANDSAT/TM series, European 
and Japanese ERS-1 satellites, RADARSAT and SIR-C missions. When 
the Earth Observation Systems (EOS) program is operational, the 
amount of data available for a particular site can only increase. 
The interdisciplinary scientist, seeking to use data from various 
sensors to study his site of interest, may be faced with massive 
, culties in manipulating such large data sets, assessing 
^- n ^ orina ^^ on content, determining the optimum combinations 
of data to study a particular parameter, visualizing his results 
and validating his model of the surface. The techniques to deal 
with these problems are also needed to support the analysis of 
data from NASA's current program of Multi-sensor Airborne 
Campaigns, which will also generate large volumes of data. 

the Case Studies outlined in this proposal, we will have 
somewhat unique data sets. For the Bonanza Creek Experimental 
Forest (Case I) calibrated DC-8 SAR data and extensive ground 
truth measurement are already at our disposal. The data set 
shows documented evidence to temporal change. The Belize Forest 
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Experiment (Case II) will produce calibrated DC-8 SAR and AVIRIS 
data, together with extensive measurements on the tropical rain 
forest itself. The extreme range of these sites, one an Arctic 
forest, the other a tropical rain forest, has been deliberately 
chosen to find common problems which can lead to generalized 
observations and unique problems with data which raise issues for 
the EOS System. 
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Construction of an Advanced Software Tool /) ^ 2 — 

for Planetary Atmospheric Modeling ' 


Principal Investigator: Dr. Peter Friedland 

Ames Research Center 


Co-Investigators: Dr. Richard M. Keller 

Ames Research Center 

Dr. Christopher P. McKay 
Ames Research Center 

Michael H. Sims 
Ames Research Center 

Dr. David E. Thompson 
Ames Research Center 


Summary: 

Scientific model-building can be a time intensive and 
painstaking process, often involving the development of large 
complex computer programs. Despite the effort involved, 
scientific models cannot be distributed easily and shared with 
other scientists. In general, implemented scientific models are 
complicated, idiosyncratic, and difficult for anyone but the 
original scientist/programmer to understand. We propose to 
construct a scientific modeling software tool that serves as an 
aid to the scientist in developing, using and sharing models. 

The proposed tool will include an interactive intelligent 
graphical interface and a high-level domain-specific modeling 
language. As a testbed for this research, we propose to develop 
a software prototype in the domain of planetary atmospheric 
modeling. 
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Construction of an Advanced Software Tool 
for Planetary Atmospheric Modeling 


Principal Investigator: Dr. Peter Friedland 

Ames Research Center 

Co-Investigators: Dr. Richard M. Keller 

Ames Research Center 

Dr. Christopher P. McKay 
Ames Research Center 

Michael H. Sims 
Ames Research Center 

Dr. David E. Thompson 
Ames Research Center 

BIBLIOGRAPHY 


Richard M. Keller and Michal Rimon, "A Knowledge-based Software Development 
Environment for Scientific Model-building", 7th Knowledae- Based Software 
Engineering Conference (KBSE-92), Tyson s Corner, VA, September 1 992. 

Richard M. Keller, "Artificial Intelligence Support for Scientific Model-building", Proc. 
AAAI Fall Symposium on Intelligent Scientific Computation, Boston, MA, October 
1992. - =>- - ; r::;: - 

Richard M. Keller, "Knowledge-intensive Software Design: Can too much knowledge 
be a burden?", Proc. AAAI-92 Workshop on Automating Software Design, San 
Jose, CA, July 1992. 

J. L. Dungan and R. Keller, "Development of an Advanced Software Tool for 
Ecosystem Simulation Modelling", Abstracts supplement of the Bulletin of the 
Ecological Society of America , 72(2) p.1 04, 1991 . 

Richard M Keller, "The Scientific Modeling Assistant: An Interactive Scientific 
Model-Building Tool", Proc. AAAI-91 Workshop on Automating Software Design, 
Anaheim, CA, July 1991. 

R.M. Keller, M.H. Sims, E. Podolak, and C.P. McKay, "Constructing an Advanced 
Software Tool for Planetary Atmospheric Modeling", Proc. i-SAIRAS'90 
(International Symposium on Artificial Intelligence, Robotics and Automation in 
Space), Kobe, Japan, November 1990. 
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System of Experts for Intelligent Data Management 


(SEIDAM) 

?-! 


Principal Investigator: Dr. David G. Goodenough 

Canada Centre for Remote Sensing (CCRS) 


Co-Investigators : 


Joji Iisaka 

Canada Centre for Remote Sensing 


Ko Fung 

University of Ottawa 


Summary: 

It is proposed to conduct research and development on a system of 
expert systems for intelligent data management (SEIDAM) . CCRS 
has much expertise in developing systems for integrating 
geographic information with space and aircraft remote sensing 
data and in managing large archives of remotely sensed data. 
SEIDAM will be composed of expert systems grouped in three 
levels. At the lowest level, the expert systems will manage and 
integrate data from diverse sources, taking account of symbolic 
representation differences and varying accuracies. Existing 
software can be controlled by these expert systems, without 
rewriting existing software into an Artificial Intelligence (AI) 
language. At the second level, SEIDAM will take the interpreted 
data (symbolic and numerical) and combine these with data models. 
At the top level, SEIDAM will respond to user goals for 
predictive outcomes given existing data. The SEIDAM Project will 
address the research areas of expert systems, data management, 
storage and retrieval, and user access and interfaces. 
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Dr. Richard J. Doyle 
Jet Propulsion Laboratory 

Dr. Donald J. Collins 
Jet Propulsion Laboratory 


Summary : 

Within this decade, the growth in complexity of exploratory 
data analysis and the sheer volume of space data require new and 
innovative approaches to support science investigators in 
achieving their research objectives. To date, there have been 
numerous efforts addressing the individual issues involved in 
inter-disciplinary, multi-instrument investigations. However, 
while successful in small scale, these efforts have not proven to 
be open and scaleable. 

This proposal addresses four areas of significant needi- - 
scientific visualization and analysis; science data management; 
interactions in a distributed, heterogeneous enviro nment ; and 
knowledge-based assistance for these functions. The fundamental 
innovation embedded within this proposal is the integration of 
three automation technologies, namely, knowledge-based expert 
systems, science visualization and science data management. This 
integration is based on the concept called the DataHub. With the 
DataHub concept, NASA will be able to apply a more complete 
solution to all nodes of a distributed system. Both computation 
nodes and interactive nodes will be able to effectively and 
efficiently use the data services (access, retrieval, update, 
etc.) with a distributed, interdisciplinary information system in 
a uniform and standard way. This will allow the science 
investigators to concentrate on their scientific endeavors, 
rather than to involve themselves in the intricate technical 
details of the systems and tools required to accomplish their 
work. Thus, science investigators need not be programmers. The 
emphasis will be on the definition and prototyping of system 
elements with sufficient detail to enable data analysis and 
interpretation leading to publishable scientific results. In 
addition, the proposed work includes all the required end-to-end 
components and interfaces to demonstrate the completed concept. 
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Experimenter * s Laboratory for Visualized Interactive Science 


Principal Investigator: Elaine R. Hansen 

University of Colorado at Boulder 

Co-Investigators: Marjorie K. Klemp 

University of Colorado at Boulder 

Sally W. Lasater 

University of Colorado at Boulder 

Marti R. Szczur 

Goddard Space Flight Center 

Joseph B. Klemp 

National Center for Atmospheric Research 


Summary : 

The science activities of the 1990's will require the 
analysis of complex phenomena and large diverse sets of data. In 
order to meet these needs, we must take advantage of advanced 
user interaction techniques: modern user interface tools; 
visualization capabilities; affordable, high performance graphics 
workstations; and interoperable data standards and translator. To 
meet these needs, we propose to adopt and upgrade several 
existing tools and systems to create an experimenter's laboratory 
for visualized interactive science. Intuitive human-computer 
interaction techniques have already been developed and 
demonstrated at the University of Colorado. A Transportable 
Applications Executive (TAE+) , developed at GSFC, is a powerful 
user interface tool for general purpose applications. A 3D 
visualization package developed by NCAR provides both color- 
shaded surface displays and volumetric rendering in either index 
or true color. The Network Common Data Form (NetCDF) data access 
library developed by Unidata supports creation, access and 
sharing of scientific data in a form that is self-describing and 
network transparent. The combination and enhancement of these 
packages constitutes a powerful experimenter's laboratory capable 
of meeting key science needs of the 1990 's. This proposal 
encompasses the work required to build and demonstrate this 
capability. 
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Topography from Shading and Stereo 
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Principal Investigator: Professor Berthold P. Horn 

Massachusetts Institute of Technology 

Co-Investigators: Michael Caplinger 

Arizona State University 


Summary: 

Methods exploiting photometric information in images that 
have been developed in machine vision can be applied to planetary 
imagery. Present techniques, however, focus on one visual cue, 
such as shading or binocular stereo, and produce results that are 
either not very accurate in an absolute sense or provide 
information only at few points on the surface. We plan to 
integrate shape from shading, binocular stereo and photometric 
stereo to yield a robust system for recovering detailed surface 
shape and surface reflectance information. Such a system will be 
useful in producing quantitative information from the vast volume 
of imagery being received, as well as in helping visu a liz e the 
underlying surface. The work will be carried out on a popular 
computing platform so that it will be easily accessible to other 
workers . 
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A Distributed System for Visualizing and Analyzing 
Multivariate and Multidisciplinary Data 


Principal Investigator: Dr. Allan S. Jacobson 

Jet Propulsion Laboratory 


Co- Investigators: Dr. Mark Allen 

Dr. Michael Bailey 
Dr. Ronald Blom 
Leo Blume 
Dr. Lee Elson 

[all from Jet Propulsion Laboratory] 


Summary: 

The Linked Windows Interactive Data System (LinkWinds) is being 
developed with NASA support. The objective of this proposal is 
to adapt and apply that system in a complex network environment 
containing elements to be found by scientists working 
multidisciplinary teams on very large scale and distributed data 
sets. The proposed three year program will develop specific 
visualization and analysis tools, to be exercised locally and 
remotely in the LinkWinds environment, to demonstrate visual data 
analysis, interdisciplinary data analysis and cooperative and 
interactive televisualization and analysis of data by 
geographically separated science teams. These demonstrations 
will involve at least two science disciplines with the aim of 
producing publishable results. 
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The Grid Analysis and Display System (GRADS) : 

A Practical Tool for Earth Science Visualization 


Principal Investigator: Dr. James L. Kinter, III 

University of Maryland 


Summary: 

We propose to develop and enhance a workstation based grid 
analysis and display software system for Earth science dataset 
browsing, sampling and manipulation. The system will be coupled 
to a supercomputer in a distributed computing environment for 
near real-time interaction between scientists and computational 
results . 
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Planetary Data Analysis and Display System: 
A Version of PC-McIDAS 
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Principal Investigator: Dr. San jay S. Limaye 

University of Wisconsin-Madison 

Co-Investigators: L. A. Sromovsky 

University of Wisconsin-Madison 

R. S. Saunders 

Jet Propulsion Laboratory 

Michael Martin 

Jet Propulsion Laboratory 


Summary: 

We propose to develop a system for access and analysis of 
planetary data from past and future space missions based on an 
existing system, the PC-McIDAS workstation. This system is now 
in use in the atmospheric science community for access to 
meteorological satellite and conventional weather data. The 
proposed system would be usable by not only planetary atmospheric 
researchers but also by the planetary geologic community. By 
providing the critical tools of an efficient system architecture, 
newer applications and customized user interfaces can be added by 
the end user within such a system. 
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Multi-Layer Holographic Bifurcative Neural Network 
System for Real-Time Adaptive EOS Data Analysis 


Principal Investigator: Dr. Hua-Kuang Liu 

Jet Propulsion Laboratory 

Co— Investigators: Professor K. Huang 

University of Southern California 

J. Diep 

Jet Propulsion Laboratory 



Summary: 

Optical data processing techniques have the inherent 
advantage of high data throughout, low weight and low power 
requirements. These features are particularly desirable for 
onboard spacecraft in-situ real-time data analysis and data 
compression applications. The proposed multi-layer optical 
holographic neural net pattern recognition technique will utilize 
the nonlinear photoref ractive devices for real-time adaptive 
learning to classify input data content and recognize unexpected 
features. Information can be stored either in analog or digital 
form in a nonlinear photof ractive device. The recording can be 
accomplished in time scales ranging from milliseconds to 
microseconds. When a system consisting of these devices is 
organized in a multi-layer structure, a feedforward neural net 
with bifurcating data classification capability is formed. The 
interdisciplinary research will involve the collaboration with 
top digital computer architecture experts at the University of 
Southern California. 
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Development of an Expert Data Reduction Assistant 


Principal Investigator: Dr. Glenn E. Miller 

Space Telescope Science Institute 


Co-Investigators: Dr. Mark D. Johnston 

Space Telescope Science Institute 

Dr. Robert J. Hanisch 

Space Telescope Science Institute 


Summary: 

We propose the development of an expert system tool for the 
management and reduction of complex data sets. The proposed work 
is an extension of a successful prototype system for the 
calibration of CCD images developed by Dr. Johnston in 1987. 

(ref. : Proceedings of the Goddard Conference on Space 
Applications of Artificial Intelligence) 

The reduction of complex multi-parameter data sets presents 
severe challenges to a scientist. Not only must a particular 
data analysis system be mastered, (e.g. IRAF/SDAS/MIDAS) , large 
amounts of data can require many days of tedious work and 
supervision by the scientist for even the most straightforward 
reductions. The proposed Expert Data Reduction Assistant will 
help the scientist overcome these obstacles by developing a 
reduction plan based on the data at hand and producing a script 
for the reduction of the data in a target common language. 

BIBLIOGRAPHY 


REFERENCES 

Yen, F. (1992) . "Draco - A Data Reduction Expert Assistant", in 
the Proceedings of the AAAI Fall Symposium on Intelligent 
Scientific Computation, Boston. 

Miller, G. and F. Yen (1992) . "The Data Reduction Expert 
Assistant" in the Proceedings of astronomy from large 
Databases II Hagenau, France, ESO. 

This last reference is an invited talk. 
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Multi-Layer Holographic Bifurcative Neural Network 
System for Real-Time Adaptive EOS Data Analysis 


Principal Investigator: Dr. Hua-Kuang Liu 

Jet Propulsion Laboratory 


BIBLIOGRAPHY 


REFERENCES 

1. Hua-Kuang Liu, "Self amplified optical pattern-recognition technique", Applied Optics . 
31 : 14 , 2568-2575 (1992). 

2. Hua-Kuang Liu, Jacob Barhen, and Nabil H. Farhat, "Optical implementation of terminal 
attractor-based associative memory", Applied Optics . 31:23,4631-4644(1992). 

3. S. Zhou, P. A. Yeh, and H. K. Liu, "Dynamic self-amplified photorefractive optical beam- 
array generation", Presented at the SPIE Annual Meeting, San Diego, July 19,1992. 

4. H.K. Lui and S. Zhou, "Complex reconfigurable free-space optical interconnection via phase 
CGH in spacial light modulators", Presented at the SPIE Annual Meeting, San Diego, 

July 19,1992. 

5. H. K. Liu, "Bifurcating optical pattern recognition in photorefractive crystals". Presented at 
the SPIE Annual Meeting, San Diego, July 19,1992. Accepted for Publication in Optics 
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VIEWCACHE: An Incremental Pointer-based Access /i j 
Method for Autonomous Interoperable Databases f ' 


Principal Investigator: Associate Professor N. Roussopoulos 

University of Maryland 

Co-Investigators: Dr. Times Sell is 

University of Maryland 

Summary: 

One of biggest problems facing NASA today is to provide 
scientists efficient access to a large number of distributed 
databases. Our pointer-based incremental database access method, 
VIEWCACHE, provides such an interface for accessing distributed 
datasets and directories. VIEWCACHE allows database browsing and 
search performing inter-database cross-referencing with no actual 
data movement between database sites. This organization and 
processing is especially suitable for managing Astrophysics 
databases which are physically distributed all over the world. 
Once the search is complete, the set of collected pointers 
pointing to the desired data are cached. VIEWCACHE includes 
spatial access methods for accessing image datasets, which 
provide much easier query formulation by referring directly to 
the image and very efficient search for objects contained within 
a two-dimensional window. We will develop and optimize a 
VIEWCACHE External Gateway Access to database management systems 
to facilitate distributed database search. 
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Advanced Data Visualization and Sensor Fusion: 
Conversion of Techniques from Medical Imaging to Earth Science 


Principal Investigator: Dr Richard C. Savage 

Hughes Aircraft Company 


Co-Investigators: Dr. Chin-Tu Chen 

University of Chicago 

Dr. Charles Pelizzari 
University of Chicago 

Dr. Veerabhadran Ramanathan 
University of Chicago 

Summary: 

Hughes Aircraft Company and the University of Chicago 
propose to transfer existing medical imaging registration 
algorithms to the area of multi-sensor data fusion. The 
University of Chicago's algorithms have been successfully 
demonstrated to provide pixel by pixel comparison capability for 
medical sensors with different characteristics. The research 
will attempt to fuse GOES, AVHRR , and SSM/I sensor data which 
will benefit a wide range of researchers. 

The algorithms will utilize data visualization and algorithm 
development tools created by Hughes in its EOSDIS prototyping. 
This will maximize the work on the fusion algorithms since 
support software (e.g. input/output routines) will already exist. 
The research will produce a portable software library with 
documentation for use by other researchers. 
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High Performance Compression of Science Data 



Principal Investigator: Dr. James A. Storer 

Brandeis University 

Co-Investigators: Dr. Martin Cohn 

Brandeis University 



Summary: 

In the future, NASA expects to gather over a tera-byte per 
day of data requiring space for levels of archival storage. Data 
compression will be a key component in systems that store this 
data (e.g., optical disk and tape) as well as in communications 
systems (both between space and Earth and between scientific 
locations on Earth) . We propose to develop algorithms that can 
be a basis for software and hardware systems that compress a wide 
variety of scientific data with different criteria for 
fidelity/bandwidth tradeoffs. The algorithmic approaches we 
consider are specially targeted for parallel computation where 
data rates of over 1 billion bits per second are achievable with 
current technology. 
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SAVS: A Space Analysis and Visualization System 


Principal Investigator: Dr. Edward P. Szuszczewicz 

Science Applications International 
Corporation 


Co-Investigators: Dr. Alan Mankofsky 

Science Applications International 
Corporation 


Dr. Charles C. Goodrich 
University of Maryland 


Summary: 

We propose to develop, test, demonstrate, and deliver to 
NASA a powerful and versatile data acquisition, manipulation, 
analysis and visualization system which will enhance scientific 
capabilities in the display and interpretation of diverse and 
distributed data within an integrated user-friendly environment. 
Our approach exploits existing technologies and combines three 
major elements into an easy-to-use interactive package: 

1) innovative visualization software, 2) advanced database 
techniques, and 3) a rich set of mathematical and image 
processing tools. Visualization capabilities will include one-, 
two—, and three-dimensional displays, along with animation, 
compression, warping and slicing functions. Analysis tools will 
include generic mathematical and statistical techniques along 
with the ability to use large scale models for interactive 
interpretation of large volume data sets. Our system will be 
implemented on Sun and DEC UNIX workstations and on the Stardent 
Graphics Supercomputer. Our final deliverable will include 
complete documentation and a NASA/NSF-CDAW/SUNDIAL campaign 
demonstration. 
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A Spatial Analysis and Modeling System (SAMS) 
for Environment Management 
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Principal Investigator: Charles H. Vermillion 

Goddard Space Flight Center 

Co-Investigators: Fran Stetina 

Goddard Space Flight Center 

Dr. John Hill 

Louisiana State University 
Dr. Paul Chan 

Science Systems and Applications, Inc. 
Robert Jaske 

Federal Emergency Management Agency 

Gilbert Rochon 
Dillard University 


Summary: 

This is a proposal to develop a uniform global environmental 
data gathering and distribution system to support the calibration 
and validation of remotely sensed data. SAMS is based on an 
enhanced version of FEMA's Integrated Emergency Management 
Information Systems and the Department of Defense's Air Land 
Battlefield Environment Software Systems. This system consists 
of state-of-the-art graphics and visualization techniques, 
simulation models, database management and expert systems for 
conducting environmental and disaster preparedness studies. This 
software package will be integrated into various Landsat and 
UNEP-GRID stations which are planned to become direct readout 
stations during the EOS timeframe. This system would be 
implemented as a pilot program to support the Tropical Rainfall 
Measuring Mission (TRMM) . This will be a joint NASA-FEMA- 
University-Industry project. 
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A Distributed Analysis and Visualization 
System for Model and Observational Data 


Principal Investigator: Professor Robert Wilhelmson 

University of Illinois 

Co-Investigators: Dr. Steven Koch 

Goddard Space Flight Center 


Summary: 

The objective of this proposal is to develop an integrated 
and distributed analysis and display software system which can be 
applied to all areas of the Earth System Science to study 
numerical model and earth observational data from storm to global 
scale. This system will be designed to be easy to use, portable, 
flexible and easily extensible and to adhere to current and 
emerging standards whenever possible. It will provide an 
environment for visualization of the massive amounts of data 
generated from satellites and other observational field 
measurements and from model simulations during or after their 
execution. Two- and three-dimensional animation will also be 
provided. This system will be based on a widely used software 
package from NASA called GEMPAK and prototype software for three- 
dimensional interactive displays built at NCSA. The underlying 
foundation of the system will be a set of software libraries 
which can be distributed across a UNIX based supercomputer and 
workstations . 
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Tuesday, August 11, 1992 


Mr. Glenn Mucklow and Mr. Joseph Bredekamp from the Information 
Systems Branch of OSSA welcomed the meeting participants to the 
second AISRP Workshop. Mr. Mucklow introduced the Workshop host, 
Dr. Randal Davis from the Laboratory for Atmospheric and Space 
Physics (LASP) , and representatives from OAST, CESDIS, and the Ames 
Research Center (ARC) . Mr. Mucklow briefly reviewed the agenda and 
noted the schedule for the workstation demonstrations during breaks 
and lunch. The Workshop Agenda and Demonstration Schedule are 
included in this Report in Appendices A and C. 

ARTIFICIAL INTELLIGENCE/EXPERT SYSTEMS/NEURAL NETWORKS SESSION 

The Chairman of first session, Dr. David Thompson from ARC, 
introduced the presentations on Artificial Intelligence, Expert 
Systems, and Neural Networks. 


Multivariate Statistical Analysis Software Technologies for 

Astrophvsical Research Involving Large Data Bases 

Dr. S. G. Djorgovski 

California Institute of Technology 

The Principal Investigator (PI), Dr. Djorgovski, and his 
collaborators are conducting a dual effort under the AISRP. They 
are developing a simple, efficient, user-friendly, interactive 
package called STATPROG for multivariate statistical analysis of 
relatively small data sets. They are also developing a large, 
complex system called FRITZ to help process and analyze large 
amounts of data (about 3 Terabytes) from the Digitized Second 
Palomar Sky Survey (POSS-2) . 

Dr. Djorgovski described the approach taken for STATPROG and its 
current status. About fifteen stand-alone programs exist and work, 
and several more are under development or testing. The package has 
been exported to two external sites for independent testing. 
Several papers based on the use of the package have been published 
or are in press. The remaining tasks are mostly documentation in 
nature. The intent is to deposit the entire package plus the 
documentation in an anonymous file transfer protocol (ftp) account 
for pickup by any interested parties. 

The FRITZ system is being developed in collaboration with the Jet 
Propulsion Laboratory (JPL) Artificial Intelligence (AI) group. It 
now runs on a Sparcstation II under the Sun Unix Operating System, 
but may be ported to a faster machine shortly. The system will 
detect objects, measure their properties, classify them, and 
catalog them. It will use the external charged coupled device 
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Discussion on key topics and issues 
continued on Wednesday evening at the 
NCAR reception 


ORIGINAL PAGE 

BLACK AND WHITE PHOTOORAPW 


Workstation demonstrations provided an 
opportunity for interaction between workshop 
attendees and tool developers 





Assistant could perform. The project was started over a year ago, 
but most of the work has been done over the past eight months. 
Removal of cosmic rays from CCD data is a prime challenge, and is 
the current focus of the Expert Assistant. The initial system has 
been developed and is being critiqued by the lead users. Based 
upon their input, the system will be revised and retested with new 
groups of users. Distribution is planned for the first quarter of 
FY 1994. There were some questions and discussion regarding 
implementation of this tool. 


SIGMA; Scientists 1 Intelligent Graphical Modeling Assistant 
Dr. Richard Keller 
Ames Research Center 

This research project is co-funded by OSSA and OAST. Model- 

building is essential for scientific advancement, yet it is time- 
consuming and error-prone. The goal of this project is to build a 
specialized software tool to assist in scientific model-building. 
The tool has been developed, but has not yet been deployed to 
support scientists in the Cassini and EOS missions. 

The two models being worked with are TGM and Forest-BGC. Dr. 
Keller discussed why these models were selected, and how the SIGMA 
approach differs from the typical manual approach to model- 
building. The three basic differences are that the knowledge base 
is made available to the system, the scientist conducts an 
interactive process at the terminal, and the result is an 
executable computation plan. As an example. Dr. Keller used the 
Titan atmospheric modeling from the Voyager Flyby. SIGMA developed 
a model for computing these elements. Details on this process are 
contained in the presentation charts in Appendix E. 

In FY 1993, the project plans to finish the Titan temperature 
determination, implement the Forest-BGC modeling scenario, build a 
graphical equation-entering facility, and deliver the first 
prototype to science users for testing and evaluation. 

In discussions. Dr. Keller noted some limits of this tool (the 
environment doesn't handle partial differential equations). In 
response to questions, Dr. Keller described the applicability of 
the tool, the testing by users, and the possibility of trying the 
tool on other models. 
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(CCD) imaging for calibration of both the photometry and object 
classification. Long term plans include the exploration and 
development of astrophysical research involving very large 
databases and the use of rapidly developing expert systems. In 
response to a question, Dr. Djorgovski indicated that there are 
plans to reanalyze previous sky surveys using these tools. 


Multi-Layer Holographic Bifurcative Neural Network for Real-Time 

Adaptive EOS Data Analysis 

Dr. Hua-Kuang Liu 

Jet Propulsion Laboratory 

Dr. Liu discussed the recent major technical achievements in this 
investigative area, as well as new technology innovations and 
publications over the past year or so. His presentation consisted 
of a theoretical discussion of the basic model and the principle of 
bifurcation, and the actual results of the experiment. Details of 
the theoretical discussions and the experiment set-up and results 
are contained in Appendix E. 

This research group has introduced a new class of optical pattern 
recognition technique which utilizes the bifurcating diffraction 
phenomenon in non-linear gain saturation memory media. Future 
research for the bifurcating optical pattern recognizer (BIOPAR) 
includes the improvement of the quality and resolution of the 
bifurcating signals and the application of the BIOPAR for Earth 
Observing System (EOS) data classification problems. 


Development of an Expert Data Reduction Assistant 
Dr. Glenn Miller 

Space Telescope Science Institute 

In this project, the investigative team focussed on the data 
reduction problem, rather than on data analysis or visualization. 
Dr. Miller described several different types of data reduction and 
analysis systems and their advantages and disadvantages. The 
Expert Assistant is an alternative approach which builds on the 
foundation of these systems and mitigates some of their 
disadvantages, particularly problems associated with data 
management and the iterative nature of the work. The goal was to 
develop a system which will act much as a graduate student: follow 
instructions on how to do data reduction; check for processing and 
data quality problems; work longer hours; and have flexibility for 
modification for new cases. 

Dr. Miller discussed the salient feature of the Expert Assistant 
and how best to involve the scientific community. Their project 
picked a group of "lead users" who had a real need that the Expert 




OAST AI Program 
Dr. David Thompson 
Ames Research Center 

Dr. Thompson (standing in for Dr. Friedland of ARC) gave an 
overview of the current OAST AI Program. This is a $13M per year 
program involving seven NASA centers. Earth and space science 
domains are a major focus of the program at ARC and JPL. The OAST 
AI Program includes the following projects: Intelligent Scientific 

Laboratory Instruments; Bayesian Data Analysis — i AutoClass; Pl-in-a- 
Box; Multi-Agent Planning for Heterogeneous Registration; Reactive 
Planning, Scheduling, and Control; Spacecraft Health Automated 
Reasoning Prototype; and Scientific Analysis Assistant. Dr. 
Thompson briefly described each of these projects and the current 
status (See Appendix E for presentation details) . All of these 
projects are intimately tied with users from the very beginning. 


DATA COMPRESSION/ARCHIVING/ACCESS/ANALYSIS SESSION 

Dr. Randal Davis chaired the session on data compression, 
archiving, access, and analysis. 


Parallel Algorithms for Data Compression 
Dr. James A. Storer 
Brandeis University 

Dr. Storer gave a brief overview of applications of data 
compression and distinguished between lossless data (decompressed 
data is identical to the original) and lossy data (decompressed 
data may be an approximation of the original) . He discussed 
systolic algorithms for lossless compression, lossless compression 
hardware, image compression visualization tools and experiments, 
image compression hardware, video displacement estimation, and 
real-time video compression hardware. However, most of the 
presentation focussed on vector quantization for image compression 
and on-line adaptive vector quantization. Dr. Storer showed slides 
of examples of compression on a medical image at 5:1, a NASA image 
at 10:1, and a visual portrait at 10:1. The compression did a 
surprisingly poor job on subtle shading on the visual portrait, and 
demonstrated the problem with visual quality on this type of image. 
Details of his presentation are included in Appendix E. 

A splinter group chaired by Dr. Storer was set up to further 
discuss the issues of data compression. 
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Performance and Scalability of Client-Server Database . Architectures 
Dr. Alex Delis 
University of Maryland 

Dr. Delis discussed ADMS , an enhanced client-server database 
architecture with incremental gateways to heterogeneous relational 
data base management systems (DBMSs). Today's needs are: f n ^ er ~ 

database querying, downloading and downsizing, inter-database 
dependency tracking and change propagation; version and change 
control; interoperability of heterogeneous relational DBMSs and 
multi-site transaction management. There are several technology 
trends that will affect the development of DBMSs, and a major 
question is how to take advantage of these technology developments 
to create DBMSs that offer fast response time and high throughput. 

Dr. Delis described the main features of the ADMS system and how it 
meets the user needs. The prototype platform for ADMS is Unix on 
Suns, DecStations, and Vaxes, with gateways for Oracle, Sybase, and 
Ingres. The ADMS enhanced client-server provides distribution of 
both processing and data, site autonomy (except for updates) , and 
minimal net traffic and overhead. Future work on the ADMS 
architecture will include: gateway query optimization; pipeline 

algorithms for interdatabase queries; adaptive update propagation 
strategies; multi-site transaction management and recovery in 
autonomous databases; an experiment with increment updates of 
mirrored databases; and applicability of the same techniques in a 
multi— processor environment with or without shared memory. 


A Land-Surface Testbed for the EOS Data In f ormation System (EOSDIS) 
Dr. William Emery 

Colorado Center for Astrodynamics Research 

The goal of this Testbed was to develop on-line AVHRR data 
distribution including collection archiving and software, emulate 
some EOSDIS functionality, and prepare snowpack and VI composite 
images. This is now called the "EOSDIS Testbed" system. 

Dr. Emery described the system— —how it is accessed, its current 
features, and plans for the future. The software is available for 
distribution for Unix and Macintosh, and PC display software has 
recently been added. The user base has expanded dramatically. 
There are about 734 users, and the user base is growing at an 
average of about four per day. The majority of users are U.S. 
educational institutions, but there is also a considerable 
government component. Dr. Emery showed a full resolution data 
example of what they started with. The image coverage was 
expanded, which created a lot of network overhead. The Navorder 
system was created for the users. The usage statistics indicated 
that users began using the browse feature in a manner that 
eliminated some of the need for data orders. 
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The project and users would like to see this system continue until 
Version 0 of EOSDIS can take over. 


Geographic Information System (GIS) for Fusion and Analysis of 

Hiah-Resolution Remote Sensing and Ground Truth Data 

Dr. A. Freeman 

Jet Propsulsion Laboratory 

Three overflights of the Flevoland calibration/agricultural site 
were made by the JPL AIRSAR, and the modified VICAR/IBIS GIS was 
used to analyze the data. VICAR (Video Image Compression and 
Retrieval) is a set of programs and procedures designed to 
facilitate the acquisition, processing, and handling of digital 
image data. VICAR/IBIS is a VICAR-based GIS which requires that 
all image data be co-registered to a georeference image. The 
modified VICAR/IBIS GIS is an extension which replaces the 
••tabular" file format with an "info" file format. The objectives 
of the modified VICAR/IBIS GIS are: to handle data in many 

different formats and from many different sources; link all data 
together through a georeference image; and track data in time, 
covert pixel values to "actual" values, plot graphs, generate 
training vectors for classification algorithms, and compare actual 
and measured parameters. 

Dr. Freeman described the implementation and results of the system 
(details are included in the presentation material in Appendix D) . 
A working GIS/Image Processing System has now been integrated. GIS 
has been exercised using multi-temporal data from the boreal forest 
test site and an agricultural site. The rain forest site has been 
classified using radar data alone. The University of California, 
Santa Barbara (UCSB) continuous and discrete canopy models have 
been integrated, and a sensitivity analysis of the models has been 
conducted. A software tool, "Light-table," has been installed for 
interacting with very large images. A model has been developed 
which estimates scattering mechanisms from radar data, and 
MACsigmaO software has been developed for release to AIRSAR data 
users. Future activities include entry of ground truth data and 
correlation with image data; complete integration of the UCSB model 
with GIS; analysis of boreal forest data; development of a new 
technique for feature selection/classification, registration of 
varying terrain height data, and development of a rain forest site 
vegetation cover map. 
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Envision: An Analysis and Display System for Large Geophysical 
Data Sets 

Dr. Kenneth Bowman 
Texas A&M University 

Envision consists of a metadata browser, a data management system, 
a set of links to feed data to existing visualization tools, and a 
set of custom designed visualization, analysis, and data 
manipulation tools. Envision requires regular nD grid type data 
which must be stored in net common data format (CDF) files. The 
grids may contain missing data or undefined regions. 

Dr. Bowman described the Envision system layout. The project has 
been concentrating on data files, user interface, and building up 
the Envision data manager. Envision is used as a metadata browser 
of netCDP files and as a metadata editor. The user interface and 
data manager allow changes and edits of metadata. The data manager 
also manages relationships between files and provides transparent 
access as a single entity to a dataset consisting of multiple 
files. Currently, most of the data management features are 
working, and user interface is being connected. In late 1992, the 
project expects release of a system with Envision data management 
facility, Envision interface, and customized connections to NCSA 
Xlmage, NCSA Collage, and some NCAR Graphics utilities. 


The following presentation on the Spatial Analysis and Modeling 
System (SAMS) was added to the agenda. This system is a practical 
application of information systems research. 

A Global Satellite Data Acquisition and Analysis System to Support 
Hydrological Modeling and Regional Climatic Change Impact Studies 
Dr. Fran Stetina 

NASA/Goddard Space Flight Center 

Dr. Stetina 1 s presentation gave on overview of the SAMS for 
environmental management. Most of the work to date has been in 
disaster management and planning. Dr. Stetina described the SAMS 
system and its design: source of data, integration of data, 
production of products, and distribution of products to users. The 
entire system is contained in one computer or group of computers 
networked together. Currently, it is in use in a number of 
countries, and a demonstration product is being used by the Corps 
of Engineers using satellite data, along with rain gage data on 
ground radar, to create more accurate rainfall maps. 

Key features of this system are: the use of expert systems to 
interpret satellite data; a classification model; and an archive 
manager which allows the model to go in and find the data it needs 
to run the model. The system is completely automated from receipt 
of data to generation of models. 
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Trends in Data Formats for the Space and Earth Sc iences 
Dr. Randal Davis 

Laboratory for Atmospheric and Space Physics 

Dr. Davis discussed the changes in archiving, distribution, and use 
of scientific data which result in increased need for better data 
formats. In the 1980's, NASA tried to move in this direction with 
pilot systems such as the Pilot Climate Data System (PCDS) , the 
Pilot Land Data System (PLDS) , the Pilot Ocean Data System (PODS) , 
and the Pilot Planetary Data System (PPDS) ; and operational systems 
such as the Astrophysics Data System (ADS) , the NASA Climate Data 
System (NCDS) , the NASA Ocean Data System (NODS), and the Planetary 
Data System (PDS) . The 1990's will bring the consolidation of 
NASA's Earth-oriented discipline data systems (NCDS, NODS, and 
PLDS) into the EOSDIS. 

In June of this year, NASA held an invitational workshop to begin 
to determine if modern data formats will meet the needs of the 
future. A number of different formats were discussed at this 
workshop, and it was concluded that good data formats are available 
for space and Earth science applications, but the relationship 
between special scientific data formats and data formats from 
general computing has to be examined further. As a result, 
developers of CDF, netCDF, and hierarchical data format (HDF) are 
examining the possibility of developing a common interface to data 
in their formats. Dr. Davis indicated that the next workshop will 
be sometime in the Fall. 

There was some discussion among the AISRP workshop attendees on the 
formats and differences within the community, and how users 
interact with this issue. A splinter group session was set-up to 
further discuss this topic and develop some recommendations. 


Wednesday, August 12 


SCIENTIFIC VISUALIZATION SESSION 

Dr. Mike Botts chaired the session on scientific visualization. 
Most of the presentations in this session were accompanied by 
visual demonstrations via computer projection screen. 
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The Grid Analysis and Display System (GrADS) 

Dr. James L. Kinter 

Center for Ocean-Land-Atmosphere (COLA) Interactions 
University of Maryland 

Dr. Kinter provided background information on the COLA group at the 
University of Maryland, and the types of research in which they are 
currently engaged. Based upon a survey of users conducted by Dr. 
Mike Botts, a number of complaints about existing scientific 
visualization tools were summarized. Most users are not interested 
in the software tools currently available for these reasons, which 
were the same problems identified by the COLA group. As a result, 
the COLA group decided to develop GrADS. They have tried to 
tightly integrate the following: access of the data; manipulation 

of the data interactively; and display of the data to the scientist 
in a way that is familiar. Other goals of this system were 
interactivity, ease of use, and the capability to generate a 
hardcopy output. 

Dr. Kinter described how GrADS was designed to meet these goals, 
how it addressed the disadvantages noted in the survey, and the 
hardware requirements for the system. An on-line demonstration was 
conducted, and an example of hardcopy output was shown. This 
system has been introduced to a group of users, and considerable 
feedback has been received which will enable future improvements. 


National Center for Atmospheric Research (NCAR) Interactive Status 

Dr. Bob Lackman 

NCAR 


Dr. Lackman discussed the collaborations with other groups since 
last year's workshop. They are using GrADS at NCAR, and with NCAR 
graphics, are concentrating on supporting the community NCAR 
graphics package with documentation and user support. The first 
NCAR graphics user conference, June 17-19, included hands-on 
training using workstations as well as presentations, 
demonstrations, and panel discussions. Dr. Lackman provided on 
overview of conference topics, and made hardcopies of the 
presentations available to workshop attendees. 

The NCAR interactive community goals are: to guarantee long-term 
support via cost recovery; provide university and non-profit 
researchers low cost visualization; and advance and support the 
scientific infrastructure through common software. NCAR graphics 
builds on existing libraries, is a single package for distribution, 
and is portable across systems. The package is set up to have 
three levels of interface: programmatic; command line; and visual 
point and click. Dr. Lackman described what NCAR interactive will 
look like. Currently, they have a functional requirements 
document, have completed preliminary design, and have a prototype 
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high level utility. Next year, they will have a command 
interpreter, and the year after that, they will be distributing 
software as part of the NCAR graphics distribution. 


A Distributed System for the Visualization and Analysis of Observed 

and Modeled Meteorological Data 

Dr. Steven Koch 

Goddard Space Flight Center 

The goal of this project, a joint effort of GSFC and the National 
Center for Supercomputing Applications (NCSA) was to create a tool 
for handling the large amounts of data generated by satellites, 
observational field programs, and model simulations; and to extend 
existing 2D mapping capabilities with new analysis functions and 
modern techniques of 3D visualization, user interaction, and 
animation. Dr. Koch discussed the project approach and its 
results. The system, called GEMVIS, maximizes the use of existing 
software; uses commercially available visualization and application 
builder tools; provides visualization and analysis capabilities in 
the areas of 3D volumes of data, evolution of data over time, and 
distributed processing; and provides a highly interactive 

environment on a single display. 

Dr. Koch and Dr. John Hagedorn conducted a demonstration of the 
GEMVIS using the Explorer map module and functions, deriving 
vorticity display. The demo included other orientations and 3D 
renderings, as well as animations of wind vectors. They discussed 
the accomplishments at GSFC and NCSA, as well as problems 
encountered and the gains and losses of 2D GEMPAK. Near term 
future work includes minor user interface improvements and fixes, 
preliminary annotation, image loop animation, and user 
documentation. They will release the software for use in the Sever 
Storms Branch of GSFC in October 1992. Long term future work 
includes a database for metadata, user interface enhancements, 
additional visualization techniques, and additional animation. 

There were questions and discussions related to the timing of 
distribution of modules and portability to systems other than SGI. 
Currently, they are now distributing it on Sun and Cray, and are 
looking at a six month window for distribution on HP and DEC. The 
problems with proprietary systems was noted and discussed. 


Planetary Data Analysis and Display System: A Version of PC-McIDAS 

Dr. San jay S. Limaye 
University of Wisconsin-Madison 

McIDAS is an evolutionary hardware/software system for Earth 
Atmospheric data in use since the mid-1970's. The planetary 
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version is aimed for analysis of primarily imaging data from space 
missions such as Voyager, Pioneer Venus, Magellan, Hubble, Mars 
Observer, and Cassini. McIDAS-X is a version for Unix workstations 
(RISC-6000, SGI, Sun) with X-Windows. Planetary PC-McIDAS 
implementation was begun under McIDAS-X. Dr. Limaye described the 
current status of system level development, new software 
applications, the data descriptor block, multiband data display and 
processing, and multispectral analysis. Dr. Limaye discussed and 
demonstrated how the optical navigation technique was used to 
successfully remove an error in roll angle to correct Voyager 
Neptune images. 


Experimenter's Laboratory for Visualized Interactive Science 

Dr. Elaine Hansen 

Colorado Space Grant Consortium 

This project was a group effort among the Colorado Space Grant 
Consortium, LASP, NCAR, GSFC, and UCSB. Dr. Hansen presented an 
overview of the program and its interactions with the user 
community. The program goals are to provide a capability that 
helps visualize data to better understand the large, complex, 
diverse, and multi-dimensional data sets; support science research 
within and across NASA science disciplines; to provide a laboratory 
that can be easily used and tailored; to provide tools at an 
affordable price; and to capitalize on existing systems techniques, 
technologies, and tools. 

Dr. Hansen introduced Dr. Margi Klemp, who discussed the usability 
analysis and software design. Usability challenges include: the 
integration of visualization, data processing, exploratory 
analysis, and communication needs into a single application; a user 
interface which will hide the complexities of the hardware and 
software required to enable these functions; the flexibility to 
make changes dynamically in the applications and to add user 
specific functionality; input of diverse data formats; annotation; 
and direct manipulation interfaces for interaction with data. Dr. 
Klemp reviewed the project accomplishments and status to date. 
Future work includes completion of the Alpha version by the end of 
the year, the continuation of user testing, the integration of the 
spreadsheet engine, the development of direct manipulation user 
interface, new graphics capabilities, enhancement of TAE+, hooks 
for calling ext ernal functions, development of a direct 
manipulation annotation editor, color hard copy interface, and the 
creation of an interface to GL for hardware rendering. 

Dr. Bill Boyd showed a slide demonstration of the PolyPaint 
functionality, which renders datasets as 3D surfaces. This system 
provides the capability to look at several features at once, and 
can facilitate meteorological research. The project is trying to 
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get these tools into scientists' hands to enable them to 
incorporate this into the research cycle. 


SAVS: A Space Data Analysis and Visua lization System 
Dr. E. Szuszczewicz 

Laboratory for Atmospheric and Space Science 
Science Applications International Corporation 

SAVS is a combined effort among 3AIC, Advanced Visual Systems 
(AVS) , and the University of Maryland. The focus of this system is 
on the multi-disciplinary databases designed to understand the 
cause-effect relationships in the solar-terrestrial system and 
their extrapolations to other planetary bodies. The major 
components of the system are: innovative visualization software 
(AVS); advanced database techniques; a set of mathematical, 
analytical, and image processing tools; and a strongly developed 
sense of scientific requirements. 

Dr. Szuszczewicz discussed the needs of the practicing scientist 
and how the SAVS attempts to meet those needs. The visualization 
system is wrapped around AVS, which provides a variety of tools for 
rendering volume data. However, AVS is just one component in the 
overall system. The SAVS design goals focus on ease and 
functionality. Dr. Szuszczewicz discussed the interactive 
functionality of the SAVS system and demonstrated this capability 
on-line. During this first year of funding, the AVS has been 
ported to lower-end platforms, and an extensible user-friendly 
architecture and data and model interface modules have been 
developed. Basic mathematical and statistical functions have been 
implemented, and the development of hooks for an interactive 
interpreter has been started. The system has been tested on the 
Combined Radiation Release Experiment Satellite (CRRES) and 
International Sun Earth Explorer (ISEE) orbits and local data 
bases, and plans have been initiated for remote data access 
capabilities. In response to a question from the audience 
regarding documentation, Dr. Szuszczewicz indicated that there will 
be a year-end report as will as NASA press releases and science 
applications reports. There was some discussion relative to 
strategies in working with vendors to accommodate proprietary 
system issues. Mr. Bredekamp noted that one of the goals of the 
program is to engage the vendor community and work licensing 
agreements for the benefit of science users. Some of the program 
participants are currently working with vendors on this issue. 
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NASA Supported Advanced Visualization Techniques 
Dr. Bill Hibbard 

University of Wisconsin-Madison 

Dr. Hibbard discussed VIS-5D and VIS-AD. VIS-5D (Visualization of 
5-Dimensional data sets) is used to visualize large output data 
sets from numerical weather models. This tool is about four years 
old, and is currently used by scientists at the University of 
Wisconsin (UW) , NASA/MSFC, NASA/GSFC, and other sites. It runs on 
SGI, IBM RISC, and Stardent, and is available as freeware by 
anonymous ftp. VIS-AD (Visualization for Algorithm Development) is 
a visual laboratory for experimenting with algorithms for 
extracting useful information from remote sensing data. It puts 
visualization where people work on developing algorithms for 
automatic processing. 

Dr. Hibbard discussed how the user can utilize VIS-AD in a highly 
interactive environment. He showed how VIS-AD is used in 
construction of algorithm. As part of his presentation, he gave a 
demonstration of VIS-5D and its interactive capabilities. VIS-AD 
can also be viewed as a McIDAS macro language where the user can 
invent data structures as needed, display any data object easily, 
and invent display frames of reference. VIS-AD can access McIDAS 
data structures. Dr. Hibbard noted that anyone who is interested 
in using this tool can contact him for instructions on how to 
obtain it. 


LinkWinds; The Linked Windows Interactive Data System 
Dr. Allan S. Jacobson 
Jet Propulsion Laboratory 

The objectives of this project were to: develop a software 

environment to support the rapid prototyping and execution of data 
analysis/visualization applications; provide a suite of tools to 
interactively visualize, explore, and analyze large multivariate 
and multidisciplinary data sets? develop a user interface which 
allows maximum data and tools accessibility with a minimum of 
training? and provide system design and tools to make the 
environment accessible to application development by users. Dr. 
Jacobson described the system and how it is implemented, 
emphasizing that the scientists drive the development of the 
system. 

LinkWinds has standardized on HDT, but other data formats are 
accessible via interaction with DataHub. A user has to edit only 
three files to use the datab ase. Dr. Jacobson conducted a 
demonstration of LinkWinds from ozone and water vapor data from the 
Microwave Limb Sounder daily files. Currently, LinkWinds has only 
been distributed to a few locations outside of JPL. More 
sophisticated tools are needed and are planned. Future plans are 
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to port LinkWinds to other Unix platforms, expand the standard 
input data formats via interaction with DataHub, and develop 
applications for visual data selection and subsetting. They also 
plan to implement an applications generator to support user 
applications development, and affect a wider distribution of 
LinkWinds . 


DataHub: Knowledge-Based Science Data Management 

Dr. Tom Handley 

Jet Propulsion Laboratory 

Dr. Handley discussed the key data base issues which had been 
identified by Dr. Jeff Dozier of the EOS Program. Scientists want 
to query and analyze a variety of different types of data. 
Analysts and modelers need access to stored satellite data, derived 
products, and model output on numerous and large objects, with data 
distribution among nine archive centers. Researchers want to 
render data on local workstations to conduct their research. There 
are a number of challenges to meeting all of these needs. DataHub 
is a value-added, knowledge-based server between the data suppliers 
and the data consumers. It meets the need to address the barriers 
associated with distributed, autonomous, heterogeneous systems. 

Dr. Handley described the relationship between DataHub and 
LinkWinds, and discussed the functional architecture, the data 
model, the software architecture and implementation, and the user 
created datasets. To date, the project has defined a general 
framework for science data management, identified a critical subset 
of data operations for the science data visualization applications, 
and developed an initial prototype (DataHub 0.3) with common user 
interaction, data format conversions, user inventory management, a 
DataHub/LinkWinds interface, and underlying object-oriented 
structure and methods. After Dr. Handley discussed current status 
and future plans, there were some questions regarding how this 
could be used in the EOS system, and some interest was expressed in 
bringing this before the EOS advisory committee. 


Visualization Discussion 
Dr. Michael Botts 

University of Alabama in Huntsville 

Dr. Botts has been on a special assignment to NASA Headquarters as 
a result of Congressional attention regarding visualization on the 
EOS mission. He made his final report, "The State of Scientific 
Visualization with Regard to the NASA EOS Mission to Planet Earth," 
available to the meeting participants, and presented its findings 
as well as some options for meeting the visualization requirements 
of the EOS mission. 
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A survey of Earth scientists was conducted relative to problems 
with current visualization tools. The applications of 
visualization are: scientific investigation; data validation; 
model and algorithm development and validation; data browse; 
information transfer; and mission operations. The primary use for 
visualization is for scientific investigation. Dr. Boots discussed 
the variety of reasons why scientists are not using the 
capabilities that are available to them today. Most of the 
visualization bottleneck is related to lack of adequate software 
which allows the scientist to take advantage of hardware power and 
to interactively visualize and analyze the data. It is 
questionable whether Commercial-Of f-The-Shelf (COTS) software will 
be adequate for meeting all the needs of EOS. Dr. Botts discussed 
the advantages and disadvantages of COTS, public domain, and in- 
house development software. 

With regard to visualization, the development environment within 
OSSA can be characterized as fragmented, and lacking adequate 
organization and funding structures as well as adequate mechanisms 
for technology transfer both within OSSA and between OSSA and OAST. 
Dr. Botts identified several general areas presently needing 
additional consideration and concentration. His report concluded 
that visualization is vital for meeting the scientific objectives 
of the EOS mission, and the applications software for putting 
visualization techniques and capabilities into the hands of the 
scientist are at present inadequate. Although there is increasing 
probability that COTS software can serve as a core for meeting many 
visualization needs, there will still be a need for in-house 
development efforts focussed on extending or modifying these tools 
to meet application-specific requirements. 

Dr. Botts presented some ideas on what could be done to increase 
the effective use of visualization and analysis tools. Items 
discussed were: the possibility of a visualization/analysis 
working group; assistance centers, located at one or more 
Distributed Active Archive Centers (DAACs) ; use of pathfinders; 
vendor programs to improve the success of COTS ; improvements to the 
licensing/procurement process; and improvements to publishing and 
remote interaction. 

In the discussion period, it was noted that all of the comments are 
valid for other disciplines as well as the Earth science and 
applications field. Inasmuch as there is natural resistance to 
doing the "learning" required to utilize some of these tools, it 
would be worthwhile to have support for the learning cycle. An 
infrastructure like this workshop could facilitate discussions on 
how to solve problems. There was some discussion on the 
"assistance centers," which were generally seen as a good idea if 
they could be staffed with diverse expertise. It was felt that 
collocating an assistance center with a DAAC would be essential to 
making it work. One of the challenges will be forging a strong 
link between the Information System Branch program and the EOS 
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program. Other aspects discussed by the workshop participants 
were: the "right way" to do software support; commercialization, 
the technology transfer issue and incentives to creative 
development; the problem of getting scientists to use the tools; 
mechanism for getting support for multi-discipline, non-project 
specific aspects of the program. 

One of the suggestions made for getting tools out to the various 
communities was for the participants in this group to go to other 
external well-established society meetings, such as the American 
Geophysical Union (AGU) and give presentations and demonstrations. 
Publication in professional journals was another suggestion. 
Another possibility that was discussed was a NASA Research 
Announcement (NRA) for investigators to propose applications for 
existing tools. These would provide some "seed" money to try out 
the tools within the scientific community. This type of mechanism 
is already a part of the EOS program. Another idea was for a 
"consumer's guide" on current tools, directed to the scientist. 


Thursday, August 13, 1992 

Before beginning the morning session, the Chairman Mr. Mucklow 
announced adjustments to the agenda for the day. 


PROGRAMMATIC ISSUES AND ANSWERS SESSION 

Information Systems Research and Technology Reports 

Mr. Glenn Mucklow 

NASA Headquarters, OSSA 

The objectives of this program are to apply advanced information 
systems technology to improve support to OSSA science programs, and 
enable continual evolution of OSSA data systems environment and 
supporting infrastructure. The elements of this program include 
the Investigator Working Groups (IWG) , applied research, technology 
development, and systems evolution. Mr. Mucklow discussed the 
approach for each of these elements. The technology transfer 
element is the key to the program. It includes testbeds, software 
support, strategic users, commercial aspects, and COSMIC, and seeks 
to bridge the gap between research and technology development and 
mission operations and post-mission science research. 
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Electronic Mail and News Groups 

Mr. Randy Barth 

Goddard Space Flight Center 

Mr. Barth provided an overview of the NASA POBox and how to use it. 
The NASA POBox simplifies mail exchange among AISRP researchers on 
various systems (the Internet, NSI-DECnet, RSCS Networks, X.25 
Networks/Nodes, and LAN & PCs connected to these systems). 
Currently, there are three AISRP group distribution lists: aisrp- 

members, for all members; aispr-pi for all Pis; and aisrp-general , 
for general discussions. The NASA Science Internet (NSI) Help Desk 
can answer questions on the use of the NASA POBox. 

There are several mechanisms for electronic discussion groups: 
discussion groups vis a central bulletin board; discussion groups 
via USENet NEWS, and discussion groups via Electronic Mail. Mr. 
Barth discussed the advantages and disadvantages of each of these 
mechanisms . 


NASA Science Internet Developments 
Dr. Christine Falsetti 
Ames Research Center 

Internet, the largest Federal network, is a collection of various 
networks. The NSI was established to provide communications to 
NASA's OSSA. It provides computer networking services, management 
and operations support, and technical assistance to authorized 
users throughout NASA centers and research institutions worldwide. 
NSI's goal is to provide a high-speed communications network that 
connects all space scientists, providing ready access to data and 
information stored anywhere in the world. 

Dr. Falsetti described the current telecommunications 
infrastructure and some of the evolving network-based applications. 
The plan for evolution is to use the networks in place and use 
advance technologies to go from 45 mbps to 2488 mbps. The National 
Research and Education Network (managed out of OAST) is leading 
this activity. NASA and the Department of Energy (DOE) have 
collaborated in a procurement to obtain advanced technology, which 
will enable universities to have better access to the service. In 
response to questions regarding connections to service, Dr. 
Falsetti advised participants to contact their regional service and 
find out what their plans are to connect to the network. NSI will 
serve as the backbone to EOSDIS. 

There was some discussion regarding the funding of regionals, and 
the prospective commercialization of these services, and how 
university users would be affected. Dr. Falsetti indicated that 
there are potential problems in this area, and this issue is 
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currently in heavy debate. Further discussion on this subject was 
deferred to a later group discussion. 


CESDIS: Center of Excellence in Space Data and Information 

Sciences 

Dr. Ray Miller 

NASA/GSFC 

The CESDIS mission is to bring together computer scientists from 
university, industrial, and government laboratories to: conduct 

computer science research having application to Earth and space 
science; focus attention on accessing, processing, and analyzing 
data from space observing system; and collaborate with NASA space 
and Earth scientists. CESDIS is managed by the Universities Space 
Research Association (USRA) . Dr. Miller described the CESDIS 
organization, activities and tasks, and the research currently 
being conducted by the CESDIS staff. 

The Stanford University project has been developing software to 
automate the analysis of global auroral images obtained from 
Dynamics Explorer (DE)-l and Viking satellites. The Duke 
University project is involved with parallel compression of space 
and Earth data. The AISRP Projects are being reported upon at this 
Workshop. Dr. Miller described other additional research tasks, as 
well as consultants and fellowships that are being funded through 
CESDIS. CESDIS has also conducted annual workshops on various data 
and information systems topics. CESDIS provides support to peer 
reviews for NRA's, organizes the "Advances in Computational 
Sciences Seminar Series," and provides support to the Minority 
University Space Interdisciplinary Network project. Future CESDIS 
direction include more coordination of AISRP projects, assistance 
with Earth and space science HPCC projects, and the building of 
ties with EOSDIS. 


SCIENCE DISCIPLINE DISCUSSIONS 

Dr. Amy Walton chaired the session on the science discipline 
discussions. Panel members were: Dr. Tom Ayres, Astrophysics; Dr. 

Mike Botts, Earth Science; Dr. Jim Willett, Space Physics; Dr. 
Robert Jackson, Life Sciences; and Dr. Steven Lee, Solar System 
Exploration. Each panel member made brief opening remarks prior to 
the general discussions. 

Astrophysics 
Dr. Tom Ayres 

Astrophysics has an unusual type of data set — the images are 
relatively sparse and easily compressed. Except for solar physics, 
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the data sets are not large. However, there are many different 
platforms and missions, and many different interfaces into the data 
set as well as a ground-based connection. In terms of bottlenecks, 
one of the major concerns is that NASA is not providing enough 
funds to procure new workstations, which are needed for more disk 
space and faster CPU. One of the most valuable tools is a screen 
editor, and the next most valuable is a flexible software 
environment. To implement research, scientists in this discipline 
need transparent access to existing and new data sets, access to 
archival data, and easily modifiable software. In parallel, they 
need access to expertise and specific software (or software 
modules). The researchers do not need a proliferation of 
sophisticated visualization systems, but tools that can be easily 
modified to attack specific problems. 

In the panel discussion, Dr. Ayres emphasized that the need for 
interdisciplinary science sharing has increased, and the PDS has 
had an influence on astrophysics. In astrophysics, the primary 
science problem is not enough telescopes in orbit. The problems 
some disciplines are encountering with large data sets has not been 
an issue for astrophysics. 


Earth Science 
Dr. Mike Botts 

The major issues within Earth science involve large data sets, and 
lots of data, and interuse of multiple data sets from multiple 
disciplines. In large data sets, the issues are: keeping 
interactivity in visualization (adaptive sampling could help) ; 
automatic feature recognition and tracking; and data compression 
(lossy vs. lossless - how much is acceptable). With the 
restructuring on EOS, there is less capability to do on-board 
processing. In the interuse of data sets from multiple 
disciplines, there are navigation, gridding, and projection issues. 
Dr. Botts recommended leaving gridding and navigation till as late 
as possible, but tools will be needed to do this. The advantages 
and disadvantages of adaptive sampling were also discussed. 

The general discussion centered around data formats. A suggestion 
was made to have a limited library of formats for choice by users, 
and visualization tools would be responsive to this limited library 
of formats. It was noted that equal emphasis is needed on the 
analysis process, and one approach is to get scientists involved 
very early in the process. Dr. Mucklow indicated that there is a 
mechanism for doing this. 
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Space Physics 
Dr. Jim Willett 

Dr. Willett indicated that this workshop had been very useful and 
helpful, particularly the opportunities to talk with other 
participants and share ideas. He noted that the tools produced by 
the scientist with the help of computer scientists are the ones 
that are most successful and are actually being used in research. 
It would be useful for someone to put together a "consumer report" 
document to categorize all of .the tools/ systems, describe salient 
features, and show how they fit together. The workshop 
participants need such a document to go into their discipline 
communities and talk about what's available. 

With regard to the earlier discussion on the problem of getting 
tools into the community, Dr. Willett noted that in space physics, 
the younger investigators used and recognized the value of new 
tools, particularly visualization tools. Some of the tools 
demonstrated at this workshop might be very useful in displaying 
magnetospheric modeling. One possible approach might be to 
initiate a joint-funded activity (space physics and information 
systems) to pursue implementation of these tools, which could also 
help with technology transfer. A suggestion was made to consider 
having a technology transfer supplement to the education outreach 
grants . 

In response to a question, Dr. Willett stated that the National 
Space Science Data Center (NSSDC) is a critical element in the 
space physics and other discipline data systems. It would be a 
node on the space physics data system, and provide a master catalog 
and a deep archiving capability. There was some discussion of the 
current OSSA data policy, and the evolution to this policy by all 
space science disciplines. Currently, space physics is the 
discipline most firmly entrenched in the old system (data 
proprietary to the PI for a period of time) . It was noted that 
there is current disarray on how this issue is handled in OSSA. 


Life Sciences 
Dr. Robert Jackson 

Dr. Jackson described the Life Sciences program for Space Station 
Freedom, and two basic research scenarios — automatic operation and 
crew assisted operation. The automatic operation mode provides 
routine operation of major facility equipment to support 
biospecimen growth and development. The bottlenecks in this mode 
are primarily limited downlink bandwidth, downlink interruptions, 
and no on-board communications outage recorder. Researchers need 
tailored bandwidth reduction techniques and efficient on-board data 
processors and storage. The crew operated mode provides for crew 
to set up runs, manipulate specimens, collect data, collect and 
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analyze samples, and maintain equipment. The bottlenecks in this 
mode include crew time and skill maintenance, limited uplink and 
downlink bandwidths and interruptions, late delivery of data and 
samples, and long storage time of samples. Research is needed to 
enable effective conduct of advanced experiments to accommodate 
limited uplink and downlink, minimize the demand for power and 
volume, and maintain and improve crew skills. 

In response to a question about metadata, Dr. Jackson that it has 
been inadequate, and life sciences is just starting to begin an 
archive. They will need help with this development, and the 
implementation to bring it on-line. Recoverable archive remains 
one of the major problems. Tools are needed to store more 
information up-front. The Pl-in-a-box concept was discussed as a 
potentially useful tool for life sciences. 


Solar System Exploration 
Dr. Steven Lee 

Dr. Lee provided a brief background of the PDS and its structure. 
JPL provides the management and top-level catalog. Colorado is in 
the Atmospheres Node. Each discipline node divides up among 
specialty nodes. Problem areas in archiving planetary data from 
active missions are: the obligation of flight projects to archive 
data; preservation of mission funds for data archiving; 
availability of project personnel for archiving tasks; definition 
of interfaces between projects and PDS; large lead times needed to 
influence archiving plans; selection of storage technology; massive 
data volumes; and the proprietary period and scheduling of data 
transfer. PDS is actively working all of these areas. 

The data formats issue came up again during discussion of the PDS. 
Dr. Davis added that the working group has come to the conclusion 
that formats should not be prescribed; a more workable approach is 
to described a standard set of data objects, and encourage users to 
accept those standards. It was noted that the tools demonstrated 
at the workshop have applicability to the planetary research 
discipline. Another issue discussed was how to get rid of old data 
where the cost of maintaining the data for exceeds its usefulness. 
Space physics is currently looking into this problem. 


TECHNOLOGY TRANSFER DISCUSSION 

Dr. Tom Handley and Dr. Larry Preheim led the group discussion on 
technology transfer. 

For the purposes of this workshop discussion, technology transfer 
was defined as the transfer of organized knowledge to a project or 
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program for the eventual purpose of producing new or improved 
products, processes, or services. The transfer can occur through 
consulting, documentation, training, demonstration, or 
collaborative technical work. Too often research and development 
has been content to "throw its product over the wall and hope 
someone will catch it." Technology issues are viewed differently 
by advanced development groups and implementation/production 
groups. Dr. Handley discussed some of the barriers to technology 
transfer. The user community lacks a process to identify common 
requirements, and a lacks a vehicle to exert leverage on NASA to 
implement common designs. The resources invested in existing 
system and applications, as well as the attitude and culture of the 
work force, make it difficult to evolve to new technologies. In 
addition, there are inadequate incentives to foster the insertion 
of new technology into new missions, and there is a fear of not 
being able to meet mission objectives (performance and budget) 
using "newer" technology. Overall, there is no documented coherent 
NASA vision for broad-based technology integration or a technology 
transfer process. 

It was noted that OSSA and OAST have been trying to work out a 
process to address some of these barriers and develop a coherent 
NASA vision. One example of good collaborative effort is HPCC; the 
key was participation and drive from the onset of the project. 
Dissemination of information alone does not produce results. 
Technology transfer occurs within the context of one-on-one 
relationships of technologists and organizations. The programmatic 
challenge is to establish these relationships. JPL is putting 
together a process for technology transfer at that institution. It 
includes a technology transfer readiness review of on-going and new 
programs. However, the dominant project individuals need to be 
advocates of technology transfer and keep the user community 
involved in the process. A flow diagram of the process is included 
in the presentation package in Appendix D. The advantages of 
collocation were discussed, and it was generally agreed that 
directly collaboration, particularly at the beginning of the 
process, is optimum. An intermediate "technology transfer" group 
can be used to sustain the activity once it is well started. 

Another issues that was discussed at some length was how to support 
the transfer once it is successful. Support can be time consuming 
and expensive, and things that need a lot of "handholding" tend to 
stay in research labs for this reason. Dr. Handley noted that 
there are several types of transfer: a single, stand-alone tool 
for a group of users; a tool which is an integral part of a larger 
system (e.g. shuttle scheduling) to other users; and technology 
which has only "potential" users. 

How to move commercial products into the NASA community was another 
popular topic, and the participants discussed the issues with 
licensing. Dr. Botts indicated that he has been working on this 
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issue, and invited interested parties to get in touch with him to 
discuss it further. 

Dr. Preheim presented a view of an "optimal solution:" establish 
three demonstration sites, GSFC, JPL, and LASP, where prospective 
users/interested parties could visit, view the tools, and have 
expertise there to walk them through the products. However, there 
would need to be coordinated testing for maturity and useability of 
the technology. One center (JPL) would be the focal point to 
address commercialization issues. This idea was discussed among 
workshop participants, who saw value in this type of approach. 
However, it was noted that users would need to know the 
availability of products at the center, and a "consumer's guide" 
was again suggested as a way to fulfill this need. 

Mr. Mucklow indicated that by this time next year, the AISRP 
investigators will need to identify how they will do technology 
transfer. A suggestion was made to have an archive site (e.g., 
LASP) for the software that was described and/or demonstrated at 
this workshop. In response to a suggestion regarding 
demonstrations in a special session at AGU, Mr. Mucklow asked that 
anyone interested in doing this, please contact him and it would be 
pursued . 


COLLABORATION AND COMMUNICATION DISCUSSIONS 

Mr. Mucklow, Dr. Willett, Dr. Gordon Johnston, and Dr. Peter 
Friedland led the group discussions on collaboration and 
communications . 

Dr. Johnston, the thrust manager for science in OAST, explained how 
OAST has been restructured from discipline-oriented management to 
the focused thrust structure consisting of science, exploration, 
and operations. Dr. Peter Friedland from ARC talked about the OAST 
culture, which is significantly different from OSSA — it is one of 
research of technology to facilitate NASA missions. In OAST it is 
unusual to have large scale peer reviews — proposals are often 
unsolicited, and are funded through the field centers. The AI 
program was advocated and approved as a mission-oriented thrust. 
The goal in the OAST program is to do a great deal of technology 
transfer at all levels, and the program office supports 
collaborative efforts with OSSA — in advocating, funding, and 
working. With respect to the issue of technology transfer on 
flight project, Dr. Friedland acknowledged that technology transfer 
is more difficult. However, technology transfer is easier to 
accomplish when connected with an experiment instead of part of the 
flight transportation system (e.g. the shuttle) . A good example of 
this was the PI-in-a-Box. 
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The group discussed the possibility of more joint NRA's. There was 
general consensus that this approach would be very attractive to 
the scientific community. OSSA hopes to increase the CESDIS 
activity to provide a mechanism for OAST and space scientists to 
work together. In response to a comment regarding funding for 
support to space science, Dr. Willett noted that the issue of 
science support funding has come up repeatedly in OSSA and in the 
Space Physics Division, but most of the activity in the office is 
toward getting new projects to fly. Mission operations and data 
analysis (MO&DA) funding is dependent on flight missions. The 
office would like to get some sort of statement regarding the right 
balance among funding to the science community, funding to 
hardware, and funding to extend missions. Funding was clearly 
recognized as a key issue, but it was also recognized as something 
that, to a large extent, is beyond NASA's control, and is driven by 
national priorities. What the community can do is work through the 
established advisory committees and have enough small scale 
missions to provide some support for R&D. The National Academy of 
Sciences (NAS) and the National Research Council (NRC) are the 
recognized advocates for science, and the community needs to exert 
influence through this structure. 

Dr. Willett noted that a mechanism for technology transfer of 
visualization tools could be an NRA type of activity where funding 
is provided tc selected groups to try out the tools and bring in 
the technology. OAST is trying to reduce the cost of mission 
operations, which could make some more funding available for 
science analysis. 


OPPORTUNITIES FOR FUTURE AISRP RESEARCH DISCUSSION 

Mr. Mucklow led the wrap-up discussion for the workshop. He noted 
that HPCC is currently under evaluation, and four to five major 
awards will result. This will generate some massively parallel 
processing testbeds. In addition, OAST is sponsoring an HPCC 
CESDIS announcement. OSSA has funding in FY 1994 to fund 
additional AISRP grants, and will have an announcement coming out 
in 1993. As a result of these workshop discussion and lessons 
learned, the program office will try to have more focussed topics 
of research. OAST could help provide an opportunity for some of 
the creative proposals that fall outside the scope of the next 
announcement . 

Some key discussion areas identified during the workshop were 
issues associated with data compression and analysis, data formats, 
and technology transfer. 

Dr. Davis, Dr. Storer, and Dr. Falsetti summarized their splinter 
group findings on data formats, data compression, and networking. 
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REPORT OF SPLINTER SESSION ON DATA FORMATS 
Dr. Randal Davis 

No one data format can do it all, even within a single discipline. 
Factors to consider in selecting data formats are: usability, 
performance, ease/difficulty of change, completeness of metadata, 
and support for data compression. Software must be provided for 
every data format. At a minimum, it roust display any header or 
label information, create a programming language data structure, 
and put the data into a "plain" file. A "consumer reports" kind of 
analysis is needed to evaluate and compare formats. Data formats 
should be "bottom-up" development. Some concern was raised about 
"top down" data formats standards efforts, such as the SFDU 
concept. 

Specific needs differ according to the type of data being handled: 
archive data, active data, quick look/browse data, and operational 
data. Archive data needs lots of metadata, minimal . system 
dependencies, and lossless compression. Metadata for active data 
is dictated by need and system dependencies may be acceptable. 
Quick look/browse may not need much metadata, and system 
dependencies and lossy compression may be acceptable. Operational 
messages require minimal metadata and are heavily encoded. 


REPORT OF SPLINTER SESSION ON DATA COMPRESSION 
Dr. James Storer 

NASA needs for data compression are: archiving of data; 
transmitting data more quickly between NASA centers and research 
institutions; data distribution on CD-ROM, etc.; fast data 
browsing. It is clear that with over a tera-byte of data per day 
to be archived in the future, fast ways to "finger through" the 
data are critical. 

The tradeoff between storing less data or having more data with 
less fidelity for specific key NASA applications needs to be 
studied. Better measures of distortion must be developed that can 
evaluate compression algorithms for specific applications and which 
allow the user to quantitatively specify a tradeoff between 
compression and fidelity. Note that such measures are also 
critical for compression algorithms to adaptively "learn" about the 
data. Continued research is needed on general purpose adaptive 
techniques that work well for the wide variety of NASA data and 
have an easily "adjustable" compression-fidelity tradeoff. Fast 
algorithms (and hardware) for real-time data compression are needed 
so that compression and decompression can be "invisible" to the 
user. Integration of data compression software and hardware into 
current and proposed NASA archives and distribution systems needs 
to be studied. Better communication between Earth scientists and 
computer scientists is needed so that compression systems that are 
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a good "match" to the applications can be developed, and so that 
Earth scientists are comfortable using these systems. 

Gathering data and transmitting data from remote sensors is an 
important issue for NASA, and one for which compression can play an 
important role by effectively increasing the bandwidth of 
communication channels. However, due to the many highly technical 
issues involved, it is reasonable to view it as a separate problem 
from the compression of data that has already been obtained. 


One action that was suggested by the participants was to identify 
some data sets on which to do a data compression exercise. This 
could be worked on at the Data Compression Workshop in the spring. 
Mr. Mucklow took the action to follow up on this. 


REPORT OF SPLINTER SESSION ON NETWORKING 
Dr. Christine Falsetti 

Issues identified and discussed were: 

- Evolving NSFNET infrastructure and support for NASA PIT's at 
NSFNET regional institutions. What is the impact on NASA Pi's? 
What reponsibility will NASA take to support NASA Pi's? What are 
our plans for support? How does NSI track NASA Pi's? 

- WAN-LAN roles and responsibilities and interface issues at 
sites. As WAN upgrades to high performance network delivery, what 
are the implications at the science site? Communities must be 
alerted that they should be working with LAN providers to ensure 
that they will be able to take advantage of the high-performance 
network when deployed. 

Request for more information about NSI at SWG's, 
particularly EOSDIS Data Panel meetings. 

- How to better listen to the science community. Perhaps 
volunteers for involvement in restructured scientist input (SSC) 
activities? 


A final action on the workshop participants in general was to pass 
along any items of group interest via electronic mail; if paper, 
pass along to Mr. Mucklow for distribution. 
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Tuesday. August 11 
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8:30 Opening Remarks: Scope of Workshop 

J. Bredekamp/NASA Office of Space Science and Applications 
G. Mucklow/NASA Office of Space Science and Applications 
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Chaired by P. Friedland 
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Astrophysical Research Involving Large Data Bases 

G. Djorgovski/California Institute of Technology 
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G. Miller/Space Telescope Science Institute 
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R. Keller/ Ames Research Center 
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Artificial Intelligence in Earth and Space Science 
D. Thompson/ Ames Research Center 
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DATA COMPRESS ION/ ARCHIVING/ ACCESS/ ANALY SIS SESSION 
Chaired by R. Davis 

1:30 pm High Performance Compression of Science Data 
J. Storer/Brandeis University 
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Wednesday. August 12 
8 : 00 a.m. Coffee 

SCIENTIFIC VISUALIZATION SESSION 
Chaired by M. Botts 

8:30 The Grid Analysis and Display System (GRADS): 

A Practical Tool for E a rth Science Visualization 
j. Kinter/University of Maryland 
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R. Lackman/National Center for Atmospheric Research 
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10:00 
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Adjourn to Reception 
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Thursday. August 13. 1992 
8:00 a.m. Coffee 

PROGRAMMATIC ISSUES AND ANSWERS SESSION 
Chaired by G. Mucklow 
8:30 Opening Remarks 

G. Mucklow/NASA Office of Space Science and Applications 

9:00 NASA Science Internet (NSI) Developments 

C. Falsetti/Ames Research Center 

9:30 Center of Excellence in Space Data and Information 

Sciences (CESDIS) Role 
R. Miller/CESDIS 

10:00 BREAK and DEMONSTRATIONS 

10:30 Science Discipline Discussion : Bottlenecks and Problem Areas 

in Earth and Space Science Processes 
Chaired by A. Walton 

Astrophysics: T. Ayres/University of Colorado 

Earth Science: M. Botts/University of Alabama, Huntsville 

Space Physics: J. Willett/NASA Office of Space Science 

and Applications 

Life Sciences: R. Jackson/ Ames Research Center 

Solar System Exploration: S. Lee/University of Colorado 
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1:30 Opportunities for Future AISRP Research Discussion 

G. Mucklow/NASA Office of Space Science and Applications 

2:00 Technology Transfer Discussion : Test Beds, 

Demonstrations, Availability/Distribution, User Experience, 
Software Support, COSMIC Role, Development of Data Test 
Suites to Evaluate Tools 

T. Handley/Jet Propulsion Laboratory 

3:00 BREAK and DEMONSTRATIONS 

3:30 Collaboration and Communication Discussion : 

Interaction of Computer Science Investigators with Earth and 

Space Science Investigators 

G. Mucklow/NASA Office of Space Science and Applications 
G. Johnston/NASA Office of Aeronautics and Space 
Technology 

J. Giffin/NASA Office of Aeronautics and Space Technology 

4:30 Summary and Action Items 

G. Mucklow/NASA Office of Space Science and Applications 

5:00 Adjourn 
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APPENDIX C: 
Demonstrations 




APPENDIX C 


APPLIED INFORMATION SYSTEMS RESEARCH PROGRAM (AISRP) 

WORKSHOP II 

Laboratory for Atmospheric and Space Physics 
Space Technology Research Building 
University of Colorado 
Research Park 
Boulder , CO 

AUGUST 11-13, 1992 

DEMONSTRATION SCHEDULE 


igust 11, 1992 

10:00 Astrophysics demo 
DECstation 


Alice Bertini, U of Colorado 


Polypaint : Experimenter ' s Lab 
for Visualized Interactive 
Science 
SGI Indigo 

Distributed Analysis and 
Visualization Systems for 
Model and Observational Data 
SGI Crimson 


Bill Boyd, NCAR 

Allison Kipple, U of Colorado 


Steve Koch, GSFC 
John Hagedorn, GSFC 
Matt Arrott , NCSA 


12:00 Tool for Planetary Atmospheric Rich Keller, Ames Research 
Modeling 
SPARCstaton 


GRADS Grid Analysis and 
Display System 
DECstation 

VIS5D and VISAD 
SGI Crimson 

3:00 SAVS: Space Analysis and 
Visualization System 
DECstation 
IBM RS 6000 

Land-Surface Testbed for 3 
DECstation 


Brian Doty, U of Maryland 

William Hibbard, U of Wisconsin 

Alan Mankofsky, SAIC 

Charles Goodrich, U of Maryland 

Tim Kelly, U of Colorado 


5:00 Interactive Environment for Keith Searight, U of Illinois 
Analysis of Large Earth 
Observation and Model Data Sets 
IBM RS-6000 

Linkwinds: Distributed System Alan Jacobson, JPL 
for Visualizing and Analyzing 
Multivariate and Multidisciplinary Data 
SGI Crimson 



DataHub: Knowledge-based 
Assistance for Science 
Visualization and Analysis 
SGI Indigo 

August 12, 1992 

10:00 GRADS Grid Analysis and 
Display System 
DECstatlon 

Distributed Analysis and 
Visualization Systems for 
Model and Observational Data 
SGI Crimson 

Astrophysics demo 
DECstation 

DataHub: Knowledge-based 
Assistance for Science 
Visualization and Analysis 
SGI Indigo 

12:00 SAVS: Space Analysis and 
Visualization System 
DECstation 
IBM RS 6000 

Polypaint : Experimenter ' s Lab 
for Visualized Interactive 
Science 
SGI Indigo 

Land-Surface Testbed for EOSDIS Tim Kelly, U of Colorado 
DECstation 


3:00 Linkwlnds: Distributed System Alan Jacobson, JPL 
for Visualizing and Analyzing 
Multivariate and Multidisciplinary Data 
SGI Indigo 

VIS5D and VISAD William Hibbard, U of Wisconsi 

SGI Crimson 

Interactive Environment for Keith Searight, U of Illinois 
Analysis of Large Earth 
Observation and Model Data Sets 
IBM RS-6000 

Tool for Planetary Atmospheric Rich Keller, Ames Research 

Modeling 

SPARC st at on 


August 13, 1992 


Thomas Handley , JPL 


Brian Doty, U of Maryland 


Steve Koch, GSFC 
John Hagedorn, GSFC 
Matt Arrott, NCSA 


Alice Bertini, U of Colorado 


Thomas Handley , JPL 


Alan Mankofsky, SAIC 

Charles Goodrich, U of Maryland 


Bill Boyd, NCAR 

Allison Kipple, U of Colorado 


10:00 


Astrophysics demo 


Alice Bertini, U of Colorado 


DECstation 


12:00 


3:00 


Tool for Planetary Atmospheric Rich Keller, Ames Research 

Modeling 

SPARCstaton 

Interactive Environment for Keith Searight, U of Illinois 
Analysis of Large Earth 
Observation and Model Data Sets 
IBM RS-6000 

Polypaint : Experimenter ' s Lab 
for Visualized Interactive 
Science 
SGI Indigo 

VIS5D and VISAD 
SGI Crimson 

GRADS Grid Analysis and 
Display System 
DECstation 

Linkwinds : Distributed System 
for Visualizing and Analyzing 
Multivariate and Multidisciplinary Data 
SGI Indigo 

Land-Surface Testbed for EOSDIS Tim Kelly, U of Colorado 
DECstation 


Bill Boyd, NCAR 

Allison Kipple, U of Colorado 

William Hibbard, U of Wisconsin 
Brian Doty, U of Maryland 

Alan Jacobson, JPL 


SAVS : Space Analysis and 
Visualization System 
DECstation 
IBM RS 6000 

DataHub: Knowledge-based 
Assistance for Science 
Visualization and Analysis 
SGI Indigo 

Distributed Analysis and 
Visualization Systems for 
Model and Observational Data 
SGI Crimson 


Alan Mankofsky, SAIC 

Charles Goodrich, U of Maryland 


Thomas Handley , JPL 


Steve Koch, GSFC 
John Hagedorn, GSFC 
Matt Arrott. NCSA 
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APPENDIX E: 

Presentation Material 


The following material was presented or distributed at the meeting. 
Full size reproductions of this material are available from 
Mr. Glenn H. Mucklow, Code SMI, National Aeronautics and 
Space Administration, Washington, DC 20546. 
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Pipoaed Photo Pine 



Current Catalog 
Generation Scheme 


Lut of object* in 
footprint , 


Gei CoortJirutes ■ 
lor Object* 


„ CUufiftcibon of objects 
in photo fooqnn! 







AUTOFXATT summary: ?roc*#sing of a singl* plat* scan 




Comtmct th* uitii plat* catalog 




Load image blocks for th* current row of footprint* 




Mosaic image blocks to fon* a footprint image 




Assess footprint image quality 




Measure footprint sky and sky sigma 




Int itlallr* rOCAS footprint catalog 




D*t*ct objects in footprint 




Assess FOCAS sky estimate 




Mtiiura objects in footprint 




Construct r*-norraali**d para«*tars 




Classify sura-thing stars and construct PSF template 








Save footprint catalog in 5AS format 




Hatch footprint catalog with catalog to l*ft and 
bottom, checking for consistency 




Co to the nert footprint, if there is one 




Check the current row of footprint* for trends and 
outliers among key parameter* 




Save the currant row' s footprint catalogs in the 
master plate catalog 




Mosaic the current row' s sky and reduced images into 
th* master plat* images 




D*l*t* th* row and go to th* next, if th*r* is one 




Check column* of footprint* for trend* and outlier* 
among key parameters 
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SIGMA'S Intelligent Assistant Approach! I [ | Knowledge Base Overview 














E 24 






CRIGiNAL PAGE IS 
OF POOR QUALITY 



MIXTURE-REFRACTIVITY-CALC Equation 











































■S 3 

O (0 
U O) 

C '+Z 
® » tfl 

s > « 
gjzl 
3S.8 
8 | fr 

o'C g 
0 ) Q_ .2 

»ss 

E-c 


3 Q 

o || o 
CL ra >« 5 

E > 2 O 

■— Cl) Q. u 


(/> CT> re 

III 

X O > 

o> c m 

<D ^ CL 
</} 

3 


E 31 


Suggest protocol changes that would result in better 
utilization of remaining time 
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ORIGINAL PAGE IS 
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lication : Cataloguing sky objects from the 

2nd. Palomar Sky Survey (POSS-II). 
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PARALLEL ALGORITHMS FOR 
DATA COMPRESSION 


Dr. James A. Storer 
Brandeis University 


August 11, 1992 


A 


Parallel Algorithms for 
Data Compression 


James A. Storer 
Computer Science Dept. 
Brandeis University 


Joint Work With: 

B . Carpentieri (ph d student) 

M. Cohn (Faculty) 

C. Contantinescu (PhD student) 
S. De Agostino <ph d student) 

E. LlTl (Post-Doc) 

Q. Ye (Ph D Student) 

R. ZltoAVolf (Ph D Student) 


Outline 

• Introduction 

• Systolic Algorithms for Lossless Compression 

• Lossless Compression Hardware 

• NC (Poly-Log) Algorithms for Lossless Compression 

• Vector Quantization for Image Compression 

• On-Line Adaptive Vector Quantization 

• Image Compression Visualization Tools and Experi- 
ments 

• Image Compression Hardware 

9 Video Displacement Estimation 

• Real-Time Video Compression Hardware 

• Current Research 
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Applications of 
Data Compression 



sender 


decompress 


receiver) 


lossless = decompressed data 

is identical to the original 


lossy — decompressed data 

may be an approximation to the original 


J V 
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Key types of data: 

• text 

• computer source/object code 

• data bases 

• numerical data 

• speech 

• music 

• gray-scale images 

• color images 

• graphics 

• GAD data 


• animation 

• half-tone/fax data 

• finger print images 

• bank check images 

• map and terrain data 

• medical imagery 

• scientific and instrument data, space data 
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Examples of Speeds Required 
for Real Time Processing: 



Text sent over a modem ~~ 2,1,60 bits per second 
(Depending on the coat uf the modem, commonly used speeds 
range from 1.200 bits per second to 0.000 bits per second.) 

Speech ~ 100,000 bits per second 

(One government standard uses 8.000 samples per second, 12 
bits per sample.) 

Stereo Music ~ 15 million bits per second 

(A standard compact disc uses 44,100 samples per second, 16 

bits per sample, 2 channels ) 

Picture Phone 12 mdlwn bits per second 

(A low resolution black and white product might require 8 bits 

per pixel 250x256 pixels per frame, 24 frames per second.) 

Black& White Video 60 million bits per second 

(A medium resolution product might use 8 bits per pixel 512 
by 512 pixels per frame, 30 frames per second.) 


HDTV ~ / billion bits per second 

(A proposed standard has 24 bits per pixel, 1024 by 768 pixels 
per frame, 60 frames per second ) 


• image sequences 


• video 


V. 
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Lossless Data Compression with 
On-Line Dynamic Textual Substitution 


J 

"\ 


Idea: 

Maintain a dictionary of strings that have occurred frequently 
in the past and replace new occurrences of these strings by 
their index in the dictionary. 

In a “lock-step” fashion, the encoder and decoder are con- 
stantly changing their dictionaries to best reflect the data seen 
thus far. 


V 

— 

Data Compression with 
On-Line Dynamic Textual Substitution 

(1) Initialize the local dictionary D to have oik* entry 
fur each character of the input alphabet 

(2) repeat forever 

(a) {Get the current match string s } 

Use a match heuristic MH to read s from the input 
Transmit bits for the index of $ 

(b) {Update D :} 

Add each of the strings specified by an update heuristic UH to D 
{if D is full, use a deletion heuristic DH to make space) 
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Generic Encoding Algorithm 




( 1 ) Initialize D by performing Step 1 or the encoding algorithm 

( 2 ) repeat forever 

(a) {Get the current match string s:} 

Receive \logi\D\\ bits for the index of s 
Retrieve s from D and output the characters of 5 


(b) (Update D } 

Perform Step 2b of the encoding algorithm. 


J V 


Generic Decoding Algorithm 
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The Match Heuristic 



• Greedy: Read the longest match possible. 

• Lookahead : Employ a lookahead buffer to check if taking 
a shorter match now will pay off with better compression 
later. 

• Special Characters : Read the longest match that ends with 
a special character (which can be specified in advance or 
dynamically learned). 


Sliding Window Data Compression 




Idea: A form cn-lir.e textual substitution where the dictionary is just a 
window of t ho hist n characters and instead of pointers being simple in- 
dices. they are [ displacement .length) pairs that indicate a substring of the 
window The update and deletion heuristics are to just “slide” the window 
Any match heuristic that reads a string that is a substring of the window 
will do; usually the greedy heuristic is used (and is provably optimal when 
all pointer have the same size) 


Noter: 

• To insure that a match of at least one character can always be found, 
a pointer value is reserved for each character of the alphabet. 

• In practice, it pays to use “fancier” methods of encoding pointers be- 
cause the distribution of pointer values (particularly the length field) 
tends not to be uniform Also, better methods of coding pointers avoid 
the inefficiency of having to divide the pointer into two fields. 


Efficient Serial Implementation: 

• [f the window extends back to the beginning of the input string, a simple 
linear time implementation is to build a position tree as you go (using 
McCreight s Algorithm) and compute longest matches by walking down 
from the root to a leaf and then matching as much additional input as 
possible 

« When the window does not extend all the way to the beginning of the 
input string, the position tree data structure can be modified to allow 
deletion of strings, or three overlapping copies of the position tree can 
be employed 
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Update and Deletion Heuristics 
for Dynamic Dictionaries 


J 


\ 

f Systolic Pipes 


Update Heuristic (DH): 

• First Character (FC) Add the last match concatenated with the 
first character of the current match 

• Identity (ID); Add the last match concatenated with the current 
match 

• All prefixes (AP): Add the set of strings consisting of the last match 
concatenated with each of the prefixes of the current match 


Deletion Heuristic (DH): 

• Freeze when Pull (FREEZE); Once the dictionary becomes full it 
is “frozen” and remains the same from that point on. 

• Restart Periodically or when Compression Drops (RESTART) Pe- 
riodically or when compression drops, remove all elements of the 
dictionary (except the characters of the input alphabet). 

• Least Frequently Used (LFU); Delete the string that has been matched 
least frequently. 

• Least Recently Used (LRU): Delete the string that has been matched 
least recently. 

• Swap when Pull (SWAP); Keep two dictionaries. When the primary 
dictionary becomes full, start learning new entries in the auxiliary 
dictionary but continue compressing data with the primary dic- 
tionary. From this point on, each time the auxiliary dictionary 
becomes full, the roles of the primary and auxiliary dictionaries are 
reversed, and the secondary dictionary is reset to be empty. 


V 
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• All processors are identical and the length of 
connections between adjacent processors can 
be bounded by a constant. 


• The structure can be laid out in linear area and 

power and ground can be routed without crossing wires. 


• The layout strategy can be independent of the number 
of chips used. A larger pipe can be obtained by placing 
as many processors as possible on a chip and then, 
using the same layout strategy, placing as many chips as 
possible on a board. 

V 











Systolic Pipe for the 

Sliding Window Method 

("wrap " architecture) 


Systolic Pipe for the 

Dynamic Dictionary Method 

(Uses a variant of the ID update heuristic that 
forms matches in a "bottom-up" fashion.) 


• reset • fulIB 

• mode • hiUA 

• flush ♦ 

• leader • 

• xuve • dummy 

• stop 


Diagram of a Single Cell 

(4,096-256- 1)=3,839 cells form a complete array ) 


Board Level Design that Implements the 

Swap Deletion Heuristic 


Processor Array 


Hesses 


CONTROLLER 


■hum ' 
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Real-Time Adaptive Lossless 
Compression Hardware 
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custom CMOS chip 


\ 


compression / decompression board 


VME Board: 

• 20 mhz clock 

• 30 custom chips (l.Ou double-metal CMOS) 

• 160 million bits per second throughput 
(one byte is processed on each clock cycle) 


HiPPI Board: 

• 100 mhz clock 

• 15 custom chips ( 3u double-metal CMOS) 

• S00 million bits per second throughput. 

























Greedy Versus Optimal Parsing 
for Text Compression 

Idea: Textual substitution algorithms typically employ greedy parsing-, that 
is, at each stage, the longest possible string is taken as the next match 
However, optimal parsing strategics-may sometimes take a shorter match 
so that bigger savings can be achieved later 


Example: 

input string: baba k 
dictionary: a, 6, (6a‘ : 1 < i < /;}. bab 
optimal parsing: 6a, 6a fc 
greedy parsing: 6a6, a, a, ... a 


Theorem: The intersection of the i th phrases of ail optimal parsings is 
non empty, and hence there is a set of canonical substrings that identify the 
positions of optimal phrases 


*? — r 


*** This is why optimal parsing can be done on-line and why parallel algo- 
rithms for greedy parsing can be generalized to optimal parsing 
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Key Results 

for Fixed-Sized (Trained) VQ 


On-Line Adaptive VQ 


A complexity analysis of codebook design and search 
shows thac optimal VQ as well as optimal k-means design is 
NP-hard. 

A complexity analysis of tree-structured VQ shows that 
finding optimal pruned trees subject to the leaf entropy or ex- 
pected depth is NP-hard, whereas a polynomial-time algorithm 
is presented when the cost function is the number of leaves or 
maximum depth. 

tree growing algorithm constructs trees that are 
well balanced among different costs; experimental results show 
it to achieve the compression performance of optimal pruning 
without the computational overhead. 

Worst- case bounds on the performance of tree search ver- 
sus optimal full search have been derived; efficient heuristics 
have been developed to significantly improve tree search per- 
formance. 

Massively parallel algorithms have been developed to im- 
plement existing VQ algorithms as well as novel ones for design 
and codebook search. 



Experiments 


Test images: 

BrainMR: 25Gx25G, 8 bits/pixel brain image (mr) 
BrainCAT: 512x512, 8 bits/pixei bram image (cat-scan) 
DonaldsonVilleL A4: 512x512 NASA Satelite Image, band A 
Donaldson VilleL AG: 512x512 NASA Satelite Image, band 6 
WomanHat: 512x512, 8 bits/pixel grayscale photo 
LivingRoom: 512x512, 8 bits/pixel grayscale photo 
FingerPrint: 763x7G8, 8 bits/pixel FBI fingerprint image 


Visualization Tools: 

• original image 

• compressed image 

• movie of adaptive growing process 

• “checkerboard’ 1 display that maps the rectangles used 

• median intensity display 

• dictionary display 


Results: Signal to noise ratios for a given compression ratio typically equal 
or better traditional fixed-size trained VQ As can be seen from the slides 
of the decompressed data, resolution of edges is especially accurate (good 
for scientific and medical data) 

NOTE: This is a huge success' The same adaptive method, with no prior 
knowledge of the data, can be used for diverse data sets while achieving the 
performance of trained methods 
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Displacement Estimation 


Idea : Approximate interframe motion by 

piecewise translation of blocks of pixels. 


Fixed Block Method (Jain and Jain [81]) 



(Rotation, zooming, etc., approximated by 
block translation, if blocks are small.) 


U = M x N, U = (M+2p) x (N+2p) 


Mean distortion function between U and IL 

J U K ^ 

D(i,j) =,7:. I Is(u(m.n)-ujm+i,n+j)), -p<ij<p 


Note : Displacement estimation is a crucial part 
of the MPEG standard . 


DMD: direction of minimum distortion, for each block 


Finding DMD for a block = evaluation of D(i,j) in 
(2p+1) x (2p+1) positions 


Hypothesis: data is such that D(i,j) monotonically 
increases as we move away from the DMD 


Limitations of Traditional Displacement Estimation 


The Model of Computation 


• Monotonicity assumption. 


• Fixed size blocks. 



DISPLACEMENT ESTIMATION ENCODER 


• Serial model of computation. 


• Input/Output is serial. 

• n : number of pixels per frame. 

• Each processor corresponds to a block of k pixels 
(i.e. n/k = Nx N). 

• Controller communicates with only one processor. 

• Data fpr the current and previous frame is 
processed by the grid while data for the next 
frame is filling up the frame buffer 


N Mil I 





Superblocks 


Def. : Superblock at time t : set of adjacent 
blocks with the same DMD at time t-1 


Properties of Superblocks: 

• Superblocks will represent areas of the image 
with the same displacement vector 

• Superblocks may have no prescribed shape 

• Superblocks may grow and shrink from 
frame to frame 


' Encoder's Algorithm at time t 

Phase 1 - all processors in parallel: 

construct search area 
compute DMD 

Phase 2 - controller : 

get DMD from each processor 
compute superblock splits 
output a list of all the splits 
output DMD of each superblock 


Idea : Use a parallel grid architecture to segment 
each frame into superblocks 


Note : We will not need the monotonicity assumption 


Phase 3 - controller : 

construct new superblocks 


Encoding Splits and Displacements 




Communication Between Controller and Processors 


Splits : Send ID of the superblocks that need to 
be disoved. 

Send list-of-splits. 


Displacements : Send one DMD for each 

superblock that has not been 
disolved and the DMD of the 
blocks for the superblocks that 
have been disolved. 


Threshold Condition : 

If size_of_data > threshold , controller disolves 
all the superblocks. 


CONTROLLER 

A 



The communication between processors and 
controller is pipelined 


Note: Due to the threshold condition, never 

worse in terms of data sent than the fixed 
block approach. 



V 
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Analysis of the Algorithm 


Implementing the Algorithm on a Pipe 


Time : O(n) (on-line algorithm). 


Space : 0(1) for processor 

O(NxN) for controller. 


Controller 



Fidelity: no worse than fixed block method 

(possibly better: no monotonicity assumption). 


Amount of data sent: 

no more than the fixed block algorithm 
(has the potential to represent "easy parts 
of a frame, e.g. background areas with 
single superblocks). 


Current Research 

Lossless Compression: 

• Polynomial-Time Algorithms for dynamic off-Line encoding, that are 
provaldy better than greedy, and are on-line decodable in sub- linear 
time. 

• Parallel approximation algorithms for dynamic dictionary 

• Sub- linear Implementations on existing massively parallel machines (CM5 
MASSPAR, etc ) 

• A 2 5 billion bits per second systolic SONET board. 

Adaptive Image Compression: 

• Provable performance bounds {for both lossless and lossy). 

• A better understanding of how quality is learned 

• New learning strategies 

• Implementations on existing massively parallel machines 

• Practical algorithms for hardware implementations. 

• Visualization tools - ones that go beyond the tools we have developed 
thus far and and allow us to better understand ‘‘what Is happening'’ 
to the data when it is compressed and to gain insights on improved 
growing / update strategies as well as distortion measures 


Output at time t-l 


Video Compression: 

• Poly- Log displacement estimation 

• Provable performance bounds. 

• Further study of the complexity of various optimization problems aris- 
ing in displacement estimation 

■ 'mplementations on existing massively parallel machines 

• Practical algorithms for hardware implementations 

• Visualization tools. 
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PERFORMANCE AND SCALABILITY 
OF CLIENT-SERVER DATA BASE 
ARCHITECTURES 


Dr. Alex Delis 
University of Maryland 


August 11, 1992 



Today’s Reality 


Today’s Needs 


» Autonomous database (lots of thorn) 


• Infer database querying. 


• Overlapping and Inconsistent. 


Download and Downsize. 


• Accessible through networks but not interoperable. 


Inter dat abase dependency tracking and change propagation 


• Multiple sites, many server machines, thousands of workstations. 


Version and Change Control 


• Architectures: centralized or Client Server. 


• Gate wav access: one database at a time. 


• Distributed heterogeneous DBMS? 


Architectures (distributed, pipelined, parallel] 


• Interoperability of heterogeneous relational DBMSs 


• Multisite transaction management. 






The Multidatabase Approach 


• Databases an 1 autonomous developed. managed and evolve in 
dependent!}, 

• There is N< ) global sehema, 

• Each database has an export schema for allowing remote access 


• Private schema is for hiding portions of the database from re 


• No interferem e from remote transac tions and no effec t in the local 
applications. 

t Inconsistency amongst multiple databases is an accepted fact of 


Technology Trends 










^ ADMS±: Enhanced Client-Server ^ 
Database Architecture with 
Incremental Gateways to 
Heterogeneous Relational DBMSs 


Site I Sue 2 j 



Workstations 


• Even- \VS runs locally an ADMS- version of the ADMS. 

• Every \VS offers a serious disk rapacity 

• The configuration follows the general Client-Server Model. 


f Main Features of the ADMS± System"') 


EXPLOIT DATA CACHING: 

• Even- \V5 User may query seiwr DBMS and choose to cache the 
data. 

• There is gain when similar or icla’ed requests are being asked. 

DIFFICULT P( >INT m CACHING: 

• After a Server 1 pdate occurs, il may need to be propagate* 1 properlv 
to all the pertinent clients 

How are l pdates of Cached Data being carried out? 

O Update Logs [Differential Files] 
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figure 1: Lug Overview 
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\ Main Features of the ADMS± System 
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Global and Local Access Path 
Distribution 


J 

"A 


• Every time a server relation is being queried then a bound between 
thr server and the particular site is created (indicating parent 
child association). 

• Updated portions of relations art' Titnestamped at the Server 
site. 


• Hybrid Access Paths: global and local. 

• Global paths are serialized by the concurrency control module 

• Local paths run in parallel with other loc al paths. 


• Every ADMS- keeps track of the last seen update timestamp of a 
server relation. 


• From those relations having larger timestamps send to the ap- 
propriate ADMS- only the pertinent pieces of the Log ==> small 
increments are sent over the network. 


• Queries and Updates are managed at the Server site utilizing a 
2 <j> Concurrency Control Protocol. 
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Combined Query Throughput 


Combined Query Throughput 
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Capabilities 


Comparison of DB Server N 

Architectures 


• Inter database queries ana mixed orwi views. 

• Dynamic migration of data from the servers to the workstations. 

• Incremental maintenance of downloaded data. 

• Lazy no- broadcast ins update strategies. 

• Database servers are autonomous. 

• Each workstation provides the glue for making the server 
database interoperable. 

• Location transparency (but no database transparency). 

• User perceives this as a really integrated distributed environment 
hut does "feel" the distribution. 



Characteristics 


Simulation Results 


Standard Client-Server 
O No distribution of data 

• Overload 

• Single Site 

Diskless Client-Server 
[Rubenstein et al. SIGMOD 87] 

O Distribution of processing (better) but still 

• no site autonomy 

• lots of net traffic 

• concurrency overhead to all queries 

ADMSi Enhanced Client-Server 

(Roussopoulos et al. Computer 86] 

O Distribution of both processing and data 

• site autonomy (except for updates) 

• minimal net traffic 

• minimal overhead 


Parameters 

• 1 Server n workstation clusters 

• Celery streams: three levels; light, medium, heavy I/O 

• CPU I/O. Network Rate etc. 

Metrics 

• Query throughput and speedup 

• Server I/O reduction 













Simulation Results 


Simulation Results 


CS and RU Throughput Rates (SQS-U) 


Throughput 

ECS Throughput (SOS U) 
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Summary 


Future Work on the ADMS± 
Architecture 


• Extensible, scalable and inexpensive architect tire. 

• Parallel access to multiple local copies. 

• Incremental access methods for reducing data transfer and main 
tenanee of downloaded data. 

• Preserves database server autonomy. 

• Glues multiple commercial DBMSs in a workstation environment 


• Gateway Query Optimization. 

• Pipeline Algorithms for interdatabase queries (N way joins). 

• Adaptive update propagation strategics. 

• Multi site transaction management and recovery in autonomous 
databases. 

• Experiment with increment updates of mirrored databases. 

• Applicability of the same techniques iu multi processor environ- 
ment with or without shared memory. 










A LAND-SURFACE TESTBED 
FOR THE EOS DATA INFORMATION 
SYSTEM (EOSDIS) 


Dr. William Emery 

Colorado Center for Astrodynamlcs Research 


August 11, 1992 
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radiometer (AVIlRRr satellite images. imago viewing. »>n line browse 
ami manipulating software via ihe Inlet nei The solrvvnre. Mntifsho 
i |*i >|- \ T NIX machines 1 anti (magic (for the Macintosh) were developed 
ai ihe Colorado (Vntet lot Asirodvuamic Keseaieh (WAR). These 
Jjsplax programs allow a user to look at an image, maiiipulate the 
histngiam. emp. enliaitee via seven nl mmies. color, oveilay. and 
animate the images The major service of the test bed is hi provide 
digital AVHRR data ol the western l 1 S In inlet ested users connected 
to the Internet. The lest lied system allows even the novice comptih 
nset In interface with the hrmvse and ordering systems. The 
snltwaic i>> C r and Motif X Windows piogramming. which allows user 
to easily mteiact with the testhed through the scieen commands. 


The law satellite data and AV1IRR images are simed on the National 
tVniei foi Ahnospheiic Reseat eh iNCAR j Mass Storage Tape Drive 
System i MSS i AM ol the browse images are stored on the gateway 
xvoi kslaiion. Due to response time needs and processing constraints 
die test bed utilises several programs to make the system 
snecesstidlx order ami deli'ei I he data 1 he il.it a tiles that are 
stored on the MSS are accessed through the N( AR s ( ray ^ MP super 
computer which retrieves lire AVIIKR images and sends the images 
Iti the ealewav wmkslafioii 


uses an epnemeris-oriven orbital model to hdiow the satellite and 
'.tail Ihe telemetry ingest piocess I he lelcMteiiy sne.irn harismiti 
hour the salellile is leceived and lecoided In the Ct’AR ground 
station (VAR stores 1 to I satellite passes pci da\ . sexen dav* a 
week, each pass consists ol I hi mcgahi les ol data The data is 
caliluated. ihnnncN sep.n.ned, geo regisleied. navigated and 
loimated lire second step ol ihe testbed sxstem is to semi all ol th 
taw and AYHRR images to the MSS Oiue all of the data has been 
placed in the stuiage din\ n»i res on the MSS. ihe gateway compmei 


The gateway computer polls tin Mss eveiyday to updaie the catalog 
ol Jill ihe A\ HRK images and law data Phis catalog is then uiodilicd 


id Jill the WIIRR images and law data Phis catalog is ilu'ir uiodilicd 
to tmeiact w ill i the »«IIk*i oidemrg and browse piogt.mis Since the 
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an entry prompt to break the secure environment of the window 


The Inst step in the te'dK’d sxstem is lor ('CAR to li.ak down 
link i Lit. i from the Nf i A A weathei saielliles lire Mat king sVsfem 


ORIGJbiAL PAOt IS 

OF POOR QUALITY 






r 


a r 


ii si' i s in si U i, I .iti im;ij:r 1 1 on i Th*- v mu-iil browse file lisi tniikl (Ik* 
image. nek-el Mu* viewing si/e. map Mu' image. overlay l lie map nn 
i he i m ape. am! semi the image mci Mu- [Metnci liont ilu- lutsi 
computer. 

Tlu* browse image is a sample of the lull image reduced hy a taeior 

ol I in hoili Mte X ami V diim’iisuins T he browse image is only useful 

in seeing (fie curreiii weather over a speeifie region ami area 
coverage limn the satellite pass, while keeping the image small 
enough to he viewed over the liiicinci 

The order program works Mic same as Hiovvse in that the user is 
given a window that interacts with a ealaing file *>T images The user 
has the option to outer one .or several images ami have them 
leimned to ihc lip anonymous directory under their name Order 
intei. .its with the MSS it places the order, hacks the process 
number for the oulei. hacks Mu:, image older nuinher. parse the 
mail from the MSS. and combines all ol this to give the image its 
original name and order numhei Oulei then lies the image lo the 
user who ordeicd ii. builds .1 file under the User's name in the lip 
ditechuv, and places the image in that diicctoi) This process takes 
5 minutes to lompleic. due inainlv to the access time lm ihe MSS 
The euricnt coverage area o! .1 hill AYHKK image consists ot all of the 
west coast out to the gtat plains using all five channels The images 
aie 2V>0<\1 \ IS.kk y, in pixel si/e 
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LIST OF PROMINENT USERS 
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UNITED STATES EDUCATIONAL FACILITIES 

Purdue University 
Northern Arizona University 
University of New Mexico 
Berkeley Universily 
Tulane Universily 
University ol Utah 
Penn State University 
University ol Arizona 
University of Nebraska 
University of Montana 
Cornell Universily 
Brigham Young University 
Columbia 

Colorado State University 

University of Virginia 

University of Cafilorma. Santa Barbara 

University of California. Davis 

University of Texas 

Universily of Oklahoma Storm Lab 

Massachusetts Institute of Technology 


UNITED STATES COMPANIES 

Lockhead Corporation 
McDonald Douglas corporation 
Oxidenlal Corporation 
MEC Analytical Systems 
Lighting Sciences Inc 
IBM 

Hewlelt Packard 
Seamans 


UNITED STATES GOVERNMENT AGENCIES 

United Stales Navy 
United States Air Force 
NOAA 

Pacific North Labs 

U S Department of Agriculture 

NASA 

NASA JPL 

U.S.G 5 

UCAR 

SCRIPS 

Woods Hole 


FOREIGN EDUCATIONAL FACILITIES 

University of Ottawa - Canada 
University of Toronto Canada 
OsloirJ Regional College Norway 


FOREIGN GOVERNMENT AGENCIES 

Department ol Fisheries and Oceans -- Canada 
United Kingdom Aero division 
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.ogins 


BREAK DOWN OF TEST BKI> GROWTH 


C) r cl e r s 

Ordering has grown from zero order to as much a* 400 orders per 
month. Currently. tiic lest bed is going thmugh transition to 
navorder And the drop off can he traced to users waiting until 
na voider is on fine ami replaces order 


November I 99 i I 

Decent be i 1991 26 

January 1992 yn 

February l‘W2 I S 5 

March 1992 4 2 0 

April 1992 3 79 

May 1692 4 00 

June 1992 3 02 

July . 1992 179 

Total Orders 2.0S2 


U s e r s 

Users are broken down into the following categories: 

U.S. Education. U,S Companies. U.S. Government. Foreign Education. 
Foreign Companies. Foreign Government, and No I.isting. 


U.S. Education 309 

U.S. Companies SO 

U.S. Government 2 63 

Foreign Education 15 

Foreign Companies 4 

Foreign Government 4 

No_! jsii ng 59 


Total Number of logins 7 3 4 


Graphs are provided for the data above. 


CHART OF ORDER PROGRAM USERS 



U.S.EDU 
U.S. COMP 
U.S. GOV 
Foreign.EDU 
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Non Listing 


Total Agencies Using Order - Images Only = 734 
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Logins Through Order System — Images 
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Logins for Order System - Images Only 
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Megabytes Of Data 


Data Files Transferred Via FTP 
Order System « Images Only 



Nov91 D*cB1 J«n92 F«b92 Mar92 Apr 92 May 92 Jun92 Jul92 

Months Of Operation -- Order System 





ORDER SYSTEM IMAGES ONLY 

Examples Of Area Coverage And Ordering Window 


Example One: 

Image Catalog Window 

Example Two: 

First Area Coverage Browse 

Example Three: 

Latest Area Coverage Browse 


1 fMW? Images Available 


I /KVaiSAT/WEST3?^40US . J 2-2 

■ new 

1 ✓DSS/'EU5fiI/HESTS204i?5B 1 12-3 

.new J 

1 /DSVCUSAT/KE5I9ZO4090.12-4 

.new 

1 /035/CU9AT/WE?T32040«?B t i2-5 

,TH?W I 

I /USS/CUSHT / WS?92fl#ffiC ,11 1 

.new 1 

I ,'85VCU5RT/HES?920405C.n-2, 

.new 

I /K>yCUSMT/WEST9204^C,ll-3, 

.new j 

I 2D5S/CUSAT/WEST920405C .11-4, 

.new 1 

/DSS/CC5AT7HEST920405C ,11-5, 

.new I 

/B3S/CU3AT/WEST920405D .11-1. 

.new | 

7B5S/CU5AT/MEST9204fj5D ,11-2, 

new 1 

/BS5/aJSST,'NtS19204Q5B, 11 *3 , 

.new | 

/PSS/CUSflT /WFST320405D . 11-4. 

new I 

7D55/CUSAT /WES1 92A405B .11-5. 

new 1 

/OSS/CU9ftT/HEST9204O6B,12-l , 

new 1 

Place Order 

Cancel Dialog 1 






I Enter File Hi»e: 

F it. itude ■ eutei Puint 

Na^igat ion by CEmR 


[ l nnqi t tide tentei F'r.' 1 r |t 



fvjIKK 1 { 1 1 pegtee:; 

Umwel: 

Lhaunel 1 
Ch am ip 1 


Re: o hit ion Value: 

t hawtpj^a 
•.hanfiel 4 
* hannel 
1 im 


Fmt cited Iwage m*»»; 

■ L** 


Pnue* ti*<u Type: 

['■‘lit*. 


Option:: 

□•..•ei lay Map 
r*mtli Hngle: 


Place Order 

E le^at ton Map M, c , 
8 Pit Image 
Browse Return 

C l P.3» ^f^Gf 1 
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Temporal Variation of L-Band HH 
vs. Crop Type 
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ENVISION: AN ANALYSIS AND DISPLAY SYSTEM 
FOR LARGE GEOPHYSICAL DATA SETS 


Dr. Kenneth P. Bowman 
Texas A & M University 


August 11, 1992 


^jENVJSJON 


An Analysis and Displav System 
for Large Geophysical Data Sets 


Principle Investigators 

Kenneth P. Bowman 

Climate Systems Research Program 
Department of Meteorology 
Texas A&.M Universirv 


John E. Walsh 

Department of Atmospheric Sciences 
Universiry of Illinois at Lrbana -Champaign 


Robert B. Wilhelmson 

• aiionai Center for Supercomputinc Applications 
Department of Atmospheric Sciences 
University of Illinois ai Lrbana-Champaign 


Keith R. Semright 
David P. W'ojiowici 
Depanmem of Armosphenc Sciences 
University of Illinois at l rbana-Champaign 


• a metadata browser and editor 

• a data management system 

• a set oflinks to feed data to existing 
visualization tools 

• a set of custom designed visualization, 
analysis and data manipulation tools 


• a database aystem, relational or otherwise 

• a new file format 

• a toolkit or library 

• a specific visualization tool 



ENVISION 
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The usual way to see what’s inside a netCDF file; 


DATA FILES 


USER INTERFACE 


DATA MANIPULATION TOOLS 


E3ENVJSION- 



The Envision Table Display /User Interface 

Dimensions . 


Pnfmct -znk^vbw -gy*^&viu*8&¥/ . fp EN VISION 


' variables Variable defined Variable noi defined 
in this dimension in this dimension 




Typical File with only the minimum required metadata: 


no urw* io*ca!wd 




fPEWVJSJON 


^ENVISION 














It is not necessary to write these changes to the file. 
This is desirable because: 

• the file may not be writeable 
(CDROM, user doesn’t own file, etc.) 

• netCDF library rewrites entire file if 
metadata is expanded 

• user may not want changes to be 
permanent 


Data is stored externally by Envision. 


However, changes to metadata may be incorporated into 
actual data files at any time user desires. 


□Ettvisforf- 


Often however, a dataset consists of more than one file. 



This makes it difficult to deal with the dataset as a whole 

* files must be processed individually 

• the user must keep track of which file the 
desired data is located in 
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Summary of data management capabilities: 

• manages relationship! between hies 

• provides transparent access as a tingle 
entity to a dataset consisting of multiple 
files 

• delivers arbitrary 1,2, 3D ’slabs" of data 
from a dataset to visualization, analysis 
and manipulation tools 

• special emphasis on time problems, etc. 
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1 . ZirribODDCTZOM 

The objective Of this proposal is to integrate a real time 
satellite acquisition and data analysis system with environmental 
algorithms and models to facilitate data acquisition for 
environmental monitoring and management. Both the satellite 
system and environmental models are in existence. We will 
develop an integrated computer environment for these packages, 
and make this integrated system transportable to most UNIX 
platforms; and make the system available to the science and 
application community to conduct global change impact studies. 

Host software systems for analysis of data and images from 
various satellites exist independently of each other; and not 
much effort has been devoted to integrating them into uniform 
systems. Such lack of uniformity poses a barrier to users who 
often have to search for ad hoc hardware and software in 
analyzing environmental information from a multitude of 
satell ites. 

We will develop an Environmental impact Assessment and Modeling 
System (EIAMS) as a framework for data gathering, analysis and 
distribution to facilitate real-time environmental monitoring 
and hydrological modeling. Such a system is required by earth 
resource scientists and managers to conduct regional climate 
analysis and impact analysis studies. 

Currently, components of EIAMS exist. These Components include a 
set of tools which can be used to analyze data and images from 
multiple satellites (or instruments! DMSP, TIROS -AVHRR, GOES, 
METEOSAT, CMS and IHSAT (including SSM/I and TOVS) . A package 
for analyzing hydro-meteorological data, GEMPAX, has also been 
integrated under this software system. Potentially this system 
can accommodate data from other instruments and satellites. 
Funded under this proposal, the next stage will be to integrate 
satellite rain-rate retrieval algorithms, hydrologic and 
agriculture models, and a geographic information system (CIS), 
all under one computing environment. The CIS and the hydrologic 
models are provided by the U.S. Army Corps of Engineers (COE ) , 
and the agricultural models by Houston Advanced Research Center 
(HARC) . 

The uniqueness of the proposed EIAMS is that it integrates 
several existing regional data basas, environmental monitoring 
systems and models into one computing environment. Numerous 
public domain software systems will be integrated and form the 
core of the system. 

Large geographic data bases exist and will be extended by planned 
satellite monitoring missions such as Tropical Rainfall 
Measurement Mission (TRMM) , and direct broadcast data from 
SEAW1FS and the Earth Observing System (EOS). EIAMS will provide 
a framework to integrate and facilitate utilization of the new 
data sets from these future NASA satellite missions. 


The primary focus of this project would be to integrate the above 
s *t of analysis tools and models under one system, and to provide 
a graphical user interface. The system will be data-driven 
instead of image-driven. Hence it will enable the scientists to 
perform scientific analysis as well as image processing. 

To test this system, several closely related applications would 
be proposed. For example, one of them is the analysis of SSK/I 
and GMS data to derive a rain-rate diurnal cycle over the Western 
Pacific ocean to support TOCA-COARE and over Thailand to support 
TRMM and Royal Rainmaking Research Institute. The Principal 
Investigators of this proposal have assisted the USAF in 
developing a real-time satellite data ingest and analysis system 
at the Joint Typhoon Warning Center in Guam. This satellite 
ground system can receive GH5 , TIROS and DMSP. The proposed 
software will be tested with data from this satellite around 
system. 

Further system tests will be conducted to evaluate the system as 
regards meeting the users requirements of both applications and 
research scientists. These system evaluation tests will be 
conducted to support co-investigators rro* U.S. Army corp of 
Engineers and Federal Emergency Management Agency, the Bureau of 
Reclamation, Royal Rainmaking Research and Development Institute 
of Thailand and the Texas Agricultural Experiment Station. 

In summary, the critical development areas of this system are: 

1) The system will have data input from a number of sources. 

Hence a data management software is needed for reformatting 
data inputs and outputs and data transactions between 
processes. 

2 ) The system will have a set of basic analysis tools such as 
image analysis and statistical analysis, The system will also 
possess sufficient flexibility to allow the user to formulate 
their own applications. Hence the application part of the 
system is a shell consisting of a number of encapsulated 
application modulas. 

3) The integration of hydrologic and agricultural models and a 
GIS will be the primary activity of this project. 

4) The user will interface with a number of data sources. Thus 
an innovative multi-task user interface based on graphics and 
visualization is needed. 

The proposed project design utilizes public domain software 
systems which would run with a multituda of software/hardware 
systems. It also utilizes some of the most comprehensive data 
bases available to the environmental scientific and commercial 
community. This, therefore, makes such a system a likely 
candidate for commercialization. 
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NASA will team up with the U.S. Army Corps or Engineers and The 
Houston Advanced Research Center to integrate the models into 
EIAM5 . NASA and HARC scientists have had a productive research 
association for the past several years. 

In the following. Section 2 will discuss the design strategy of 
EIAMS; Section 3 will detail the critical development areas; and 
Section 4 will discuss the applications for flANS. Sections S 
through 9 will cover the implementation plan, management 
approach, personnel, budget and available facilities. 

2 . SYSTEM DESCRIPTION 

It is the intent of the proposed project to show the feasibility 
of using satellite data and associated spatial modeling and 
analysis system as a unified tool to assist scientists in 
forecasting potential draughts, impacts on agricultural and 
ecological systems, and to eventually make recommendations for 
potentially impacted regions. This requires complex analysis of 
geographic data which need to be organized to support rapid and 
effective simultaneous usage. 

we have considered integrating a number public domain software 
systems. For example, GSFC has a number of tools for satellite 
data ingest and analysis; COE has a GIS and hydrologic models; 
HARC is providing the agricultural models, and image processing 
will be done with JChoros, a University of New Hexico public 
domain image processing package. 

The core software packages to be integrated into EIAMS are; 1) 
software for satellite and environmental data acquisition, 2) 
software for analyzing and displaying satellite and in situ data 
(raster and vector on the same display), 3) hydrologic and 
agriculture modele, 4) data management software, 5) geographic 
data bases and software to integrate them, and 6) a smart user 
interface. Software systems for 1, 2 and 3 are either existing 
or are under development at the International Data Systems 
Office, GSFC or HARC. Hence the critical development areas are 
in 4 , 5 and 6. 

EIAMS will be user friendly and flexible to include algorithms 
and models of specific concern to users (e.g. hydrologic models) . 
It can be configured to run on a single machine or on a 
distributed network system wherein software modules will be 
running in different computers and users will have workstations 
with access to the network. 

The modules (or subsystems) in EIAMS and their connectivity can 
be illustrated in Figure 1. With this configuration, EIAMS 
recsives data from the Global Telecommunication Service and any 
local environmental and meteorological reports. Its own 
satellite acquisition equipment is able to receive TIROS and any 
geostationary satellite (GOES, CHS, METEOSAT and INSAT) . The 
integrated module for satellite and environmental data display 


can handle both raster and vector graphics at the same time and 
is also connected to the geographic information module. This 
analysis and display subsystem includes scientific algorithms 
such as remote sensing retrievals of atmospheric temperature and 
water vapor soundings, sea surface temperature, rainfall rate, 
vegetation index and other geophysical parameters. Initially, 
the environmental models that are part of EIAMS includes a 
hydrological model for flood prediction and agricultural models 
for environmental assessment. 

The data flow among the collection of software subsystems is 
managed by an Innovative data management software, which employs 
a archive manager and reformat compiler to incorporate 
flexibility into EIAMS. Such flexibility allows the adding or 
upgrading of geographic information and environmental models. 
Furthermore, the user will be interacting with a number of 
software modules. They are, for example, the environmental 
models, image and data analysis software and a geographic 
information system. Hence the user interface is designed to 
allow the user to interact with several processes at the same 
time . 


3. CRITICAL DEVELOPMENT AREAS 

A critical component of future Earth science studies wilt be the 
development of an interactive spatial information and analysis 
system. A scientist or a manager/decision maker should not have 
to learn all of the necessary technologies to process data in 
order to make informed decisions. No single system is fully 
capable of performing the wide variety of GIS and spatial 
processing tasks required for a quick response to environmental 
events. Therefore, software integration issues are of paramount 
importance in the selection of components and the design of the 
overall system. The system should gracefully handle combinations 
of raster, vector and tabular data while maintaining proper 
orientation in both space and time. 

The wide variety of data sources and software package to be 
potentially integrated demands careful attention to initial 
system design, data reformatting programs, and data set format 
tracking. A flexible data interface to the spatial modeling 
software will feed the GIS data into the models and feed the 
model output back into the GIS for generating output products. 
All data sets must be properly tagged to track time sequences, 
assessments of the coupling of various parameters, and to compare 
model output to reality and to those from different or improved 
models . 

Four development areas are considered critical. They are: 1) the 
development of a data management software, 2) the Integration of 
existing analysis tools, and 3) the integration of models and 
GIS, and 4) the development of a graphical user-interface These 
critical development areas will be discussed in the following 
subsections . 


3.1 DATA MANAGEMENT 


Several software packages may be relevant in the analysis of an 
environmental event, each containing features beat suited to 
certain parts of the analysis and each requiring different data 
and command input. A complete analysis cf the environmental 
event may include communication between processes, for example, 
in feeding one application package's output to another package as 
input or combining the results of several models to make 
predictions. Hence we have to develop an Archive Manager and a 
Reformat Compiler. 

An Archive Manager will serve as a librarian, knowledgeable as to 
the whereabouts and format of all relevant data. The data can 
encompass a wide range of media (disk drives, magnetic tape, 
optical disk, etc.) and cover a wide geographic area, and have 
varied formats. The Archive Manager will be capable of 
determining what data is available in response to a staple user 
query specific to the type of data being searched for. For 
example, a search could be made for the satellite images covering 
a particular region, or the rainfall rates during a particular 
time in a particular region. Thus, queries can be in regard to 
temporal and spatial as well as to data specific attributes such 
as pixel resolution. 

A potential role of the Archive Manager would be a front end to a 
relational data base management system (RDBMS) . It would 
communicate between the user and the RDBMS helping to navigate 
the numerous catalog relations, translating spatial or temporal 
queries into RDBMS syntax and praserving query contexts and 
accumulated query results. The Archive Manager would accept 
varied user query, such as geogtaphic place naaee instead of 
geographic coordinate, and translate units such as "feet" or 
"meters" into appropriate model coordinate value. 

A Reformat Compiler produces data in a format suitable for a 
particular model or analysis module. Responding to a command to 
transfer data from one package to another, the Reformat Compiler 
will read a file describing the input format of the data and the 
desired output format. For many instances, pre-existing data 
descriptions (known to the Archive Manager) associated with the 
data sets, and output data formats and associated with the 
application packages could be used as automatic input to the 
Reformat Compiler so that data translation could be performed on 
the fly. The operation of the Reformat Compiler can be 
summarized by Figure 2 . 

A comprehensive Reformat Compiler (RC) in needed to reformat and 
input thm numerous varied data sets. Users will be able to add 
applications to ths system. Standard system integration tools 
such as X-vindows, Network File Systems (NFS), and UNIX operating 
system will be implemented as appropriate. We envision 
developing a "seamless" software system where various data sets 
from numerous satellites are being analyzed under one system. 
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THE GRID ANALYSIS AND DISPLAY SYSTEM 
(GrADS) 


Dr. James L. Kinter III 
Center for Ocean-Land-Atmosphere 
(COLA) Interactions 
University of Maryland 


August 12, 1992 


THF GRID ANALYSIS AND DISPLAY SYSTEM 
1 GrADS): 

A PRACTICAL TOOL FOR 
EARTH SCIENCE VISUALIZATION 


BRIAN E. DOTY 
JAMES L. KINTER III 


CENTER FOR OCEAN-LAND- ATMOSPHERE INTERACTIONS 
DEPARTMENT OF METEOROLOGY 
UNIVERSITY OF MARYLAND 
COLLEGE PARK, MD 20742 
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CENTER FOR OCEAN-LAND-ATMOSPHERE INTERACTIONS 


( \ 
CENTER FOR OCE sVLAND- ATMOSPHERE INTERACTIONS 

Basic Research in: 


• Atmospheric General Circulation Model (AGCMi 



• R40LI8 ^solution with full phvsics and dijrnal cvde 

• Climate Modeling 


• Simple Biosphere Model 

Monthly to Seasonal Predictability 


Each seasonal simulation 

Interannual Variability 


- requires 25 hours Cray Y-MP time 

Deforestation 


- requires up to 64 MB memory on one processor 

Desertification 


- generates 2.7 GB data 

Satellite Sensor Accuracy Requirements 





• Oceanic General Ciiculation Model (OGCMj 

• Data Assimilation 



Reanalysis for Climate Diagnostics/Monitoring 


■ Based on GFDL Modular Ocean Model 



- Each seasonal global simulation 

* Numerical Weather Prediction 


- requires 5 hours Crav Y-MP time 

Dynamic Extended Range Forecasting (DERF) 


- requires up to 32 MB memory on one processor 



- requires 120 MB SSD (Cray Solid State Disk) 

• Ocean Modeling 


- generates 1 GB data 

Tropical Atlantic Variability 



Coupled Climate Modeling 


• Global Data Assimilation System 

Use of New Sensors 





Based on NMC GDAS 

• Dynamical Systems 


Each seasonal reanalysis 

Predictabilit Theory 


- requires 140 hours Cray Y-MP time 



- requires up to 64 MB memory on one processor 

• Information Systems 


- generates 4 GB data 

Visualization, Data Manipulation and Management 





===> SUBSTANTIAL SUPERCOMPUTER TIME REQUIRED 



===> DATA MANAGEMENT £ VISUALIZATION CRITICAL 

: J 
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A 

Complaints About Existing Scientific Visualization Tools 

(adapted from results of a survey conducted by Mike Botts) 

* Tool is no! extensible or flexible 


* Tool is loo difficult lo lea r n and use 


* Too difficult to get existing data into the tool 


* Tool does not adequately link processes of visualization 
and analysis 


* Pioblems with heterogeneous environments 


* Tool designed with computer graphics, rather than 
science, in mind 


* Scientists not aware that the tool exists 


* Scientists lack appreciation that the visualization tool is 
a serious scientific tool 


^ * Too difficult to communicate the results ^ 

r 


GrADS i DESIGN GQAL£ 


INTEGRATE: 

ACCESS — MANIPULATION 

47) Gridded Data Expressions 

Station Data Functions 

General Slices 


- DISPLAY 

Maps 

Charts 

Animation 


INTERACTIVE: 


Sub-Second Response 
Data, Display Control 
Scripting, Programmability 


Design 


Command i 
UMerpreterJ 



EASE OF USE: 

Easy to Learn 
Intuitive 


Po9« 7 


COLA/UMCP 


HARDCOPY: 
Vector Graphics 
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Interpreted Command Line Scripting Language 




Language Design: 



Ch'ADS - IMPLEMENTATION 



• Programmability: as simple as possible 






* Form GrADS commands via string manipuhlio.is and pass 






back 10 program for execution 



PORTABILITY': 



• Return command results as script variables 



ANSII Standard C Code 






Isolated Device-Specific Graphics 
Standard Unix "Stream" Datasets 



Language Elements: 






• Variables of type "character" 



HIGH SPEED GRAPHICS: 



* Arithmetic and logical operators 



Built-In Routines 



• Built-in and user-specified functions 



Optimization 



• Flow control: loops, if/then/else 






* Fully recursive 



EXPANDABLE FUNCTIONALITY: 



Sami pie Usage: 



Easy to Add Graphical Displays 
Easy to Add Data Manipulation Functions 



• Automate commonly used command sequences 





• Perform complex calculations 






• Create new GrADS data files from results of GrADS 






calculations 






‘ Interact with the graphics screen 
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GrADS - REQUIREMENTS 



GrADS DATA SETS 



HARDWARE: 


, 

GrADS INTERNAL FORMAT 
- Binary 



Moderately fast Processor 



- Optimized for I/O Performance 



(386/7, any RISC) 



CREATE 



Moderate Resolution Display 



Fortran OR C 



(VGA or better) 



Standard I/O Statement 



Dynamic Memory Usage (0.5 MR Minimum) 


• 

MODIFY 

Fortran or C or UNIX file commands 



3 MB disk for GRADS executables and map files 



Update in place 






Extend 



Enough Disk Space for User Data 






(Local or on Network Server) 


• 

USE IN OTHER APPLICATIONS 
Fortran or c 



Postscript Printer - Color or Monochrome 






(other devices can be easily supported) 


m 

PORTABILITY 

All UNIX Computers (E. G. NFS) 



SOFTWARE: 



DOS-based personal computers 



ANSII Standard C Compiler + Libraries 


• 

OTHER FORMATS CAN BE SUPPORTED 
(packed binary, ASCII, net CDF, etc.) 



X Windows (UNIX) 





v_ 
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► Model output analysis 

• Globa! atmospheric general circulation models 

• Global ocean models 

■ Tropical models 

■ Coupled ocean-atmosphere models 

■ Observational data analysis 

Station data (African rainfall, Asian soil moisture, etc.) 
Gridded objective analyses 


Interactive classroom use 
Student research projects 
Student self-education 


Forec ast ing: 


1 Real lime observational data analysis 


Public information: 


Daily weather forecasts 
Maryland state ozone maps 
Seminar with interactive displays 


GrADS - 


GROUPS 


(in chronological order of contact) 

l,S. IMVKRSr TIES AND FEDERAL LABORATORIES 

University of Maryland at College Park 
Columbia University (Lamoni -Doherty Geological Observatory j 
Colorado Stale University 
California Institute of Technology (NASA JPL) 

National Center for Atmo phere Research 
UNIDATA (University Corporation for Atmospheric Research ) 

NASA Goddard Space Flight Center (Codes *>10.3. 910.4. 91. G 
NOAA/National Meteorological Center (Dev. Div. and CAO 
NOAA/ERL Geophysical Fluid Dynamics Laboratory 
NOAA/ERL Air Resources Laboratoiy 
NOAA/ERL Forecast Systems Laboratory 
U.S, Geological Survey (USGS - Reston, VAj 


U.S. COMMERCIAL INSTITUTIONS 


Atmospheric and Environmental Research. Inc. 


FOREIGN UNIVERSITIES AND GOVT. LABORATORIES 


INPE/CPTEC (Space Studies institute. Brazil) 
CNR/TMGA (Geophysics Institute, Italy) 
ICTP/ICS (Climate Institute, Italy) 
ENEA (Energy Agency. Italy) 

National Tidal Facility (Australia) 
KIST/SERI (Korea) 

Dalhousie University (Canada) 
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The HLU visualization model The HLU visualization model 



After master plot is drawn, overlays are drawn on 
top of the master plot in the order they were 
registered 





Using NCAA Graphic* in a Data Flow Environment 

I. Introduction 

II. Issues addressed: 


c. Future «nh*nc«Mots to NCAA Graphics for 

d. Hints on importing NCAA Graphic* into « 

Definition of a data flow rnvi rooment ? 

ZV. Modal a Types 


;iow environment 


*?' "JJ** 1 to *ub«ampl« , mat sage and process your data 

a !Jf P : to ircnv 'tn. data into a visual representation 

d. Display - used to render your image or output. 

V. Popular Examples of Data Flow Environments; 

*• JJJ nlf!! l f« 0n , pr0llucti00 Environment) originally developed at 
the Ohio Super Computer Center, Now TaraVisuel Corp 
“■ Khoroa - Univcraity of New Mexico 

d. Explorer - Silicon Graphics 

e. AV5 {Application Visualiiation System) - Advanced visual Systems. r ac . 

VI. Benefits of Using NCAA Graphics ia a Data Flow Environment 

*- of use for "Non- programmers* 

b. Widget Customization 

c. Distributed Processing, 
e. Image Processing Tools 
t. Public Dana in Modules 

i. International A VS Center ravs.ncse.org) 

N. Carolina Super Computing Center 

ii. UX and US site far Explorer Kxlules 
svedishchef . lerc . nasi . gov 

g. NCAR Graphics adds more power and flexibility to Data Flow Environments 
VTI. Discussion of reasons for not using NCAR Graphics with a Data Flow £nv. 


NCAA Interactive 

NCAR Graphics functionality already 
Cost 3 

NCAR Graphics does not currently support g* 


b. NCAR Graohics functional!*, < n Data Flow Environments 

d NCAR Graohics do*, nor — letric data output. 

VIII. Importing NCAR Graphics into a Data Flow Environment Module 

a. Module Components 1 

i. Parameter input specification (widgets ) 

11. Data input ipecificatioo (input port type) 
lii. Data output specification (output port types 
iv. Computation routine (NCAR Graphics program) 

b. Module Building Templates 

IX. Displaying NCAR Graphics output in a Data riow Environment 

X. S urinary 
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A Distributed System for the 
Visualization and Analysis of Observed 
and Modeled Meteorological Data 



Lnclosed is j copv ot our presentation nuicri.il for the Advanced Information Systems 
Research Program Work shop held in Boulder. Colorado on Wednesday. August 12. W2 
titled "A Distnhuied Analysis System lor Model and Obsrrv ational Data 1 





* Maximize the use of existing software 

- GEMPAK, meteorological analysis and display 
software developed at NASA / GSFC 


GEMVIS: A Distributed System for the Visualization and 
Analysis of Observed and Modeled Meteorological Data 

A joint effort between NASA/GSFC and the National 
Center for Supercomputer Applications 
funded by OSSA/Appfied Information Systems Program 


Various 3D interactive capabilities developed at 
NCSA 


Commercially available visualization systems 
(AVS and Iris Explorer) 


* Use commercially available visualization and 
application builder tools (IRIS Explorer and AVS) 

* Provide visualization and analysis capabilities in 
these areas: 

- 3D volumes of data 

(isosurfaces, 3D cross sections, vector 
displays, etc.) 

- Evolution of data over time 

(time sequences, trajectories, animation, etc.) 


GEMPAK analysis 
and display 
software 


- Distributed processing over workstations and 

supercomputers, including interactive control 
of simulations 

* Provide a highly interactive environment on a single 
display 


New visualization and distributed 
processing capabilities 
developed at NCSA 


Present 2D GEMPAK Functions 


TIME SERIES DISPLAY 

\ 

X7 OR T/Z CROSS SECTIONS 


HODOGRAPHS 

/ 

VERTICAL PROFILES 


2D DISPLAY 
FUNCTIONS , 


— THERMODYNAMIC DIAGRAMS 





At ser D 




SCALAfWECTOR > 
DIAGNOSTICS > 


^ COORDINATE ^ 
TRANSFORMATIONS 
JMAP PROJECTIONS), 
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EXTRACT 
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Two-dimensional depictions of a tropopause fold (potential vorticity) 
using GEMPAK vertical cfoss section and horizontal contour mapping functions 
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Create a flexible, modular, and distributed 
environment for data handling, model simulations, 
data analysis, and presentation to be used in 
studying atmospheric flows on all scales. 


Solution: 

Likely environments: 

- AVS 

- Khoros 

- IBM's Data Explorer 

- SGI’s Explorer 


Requirements: 

A software technology framework which 
incorporates the following as fundamental design 
characteristics: 

- Change (extensibility, user requirements, 

functionality, hardware, etc.) 

- Integration with Other Solutions 

(hardware/software systems, output 
media, etc.) 

- Tiered Access for Different Users 

- Tools for Constructin User Interface 


Distributed Computing 
Portability 


None is proven. Of these, AVS is the most widely 
used and NCSA has been working with AVS for 2 
years. The others are promising - need 
investigation. 

We choose SGI Explorer as our primary tool for these 
reasons: 

- 2nd Generation Design 

- Natural distributed processing with natural 

concurrent execution framework 

- Application User Interface designer 

- Easily Extensible 

- Impressive Development Team 

- NCSA in a position of influence as to the direction 

of Explorer's development 

GSFC and NCSA spent time up front to prove feasibility 
of translation between Explorer and AVS modules, to 
safe-guard the decision to use Explorer over AVS. 


- 3D Interactive Performance 
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Accomplishments (GSFC) 



r 
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Accomplishments (NCSA) 


V. 

a r 


* Tested alpha and beta releases of Explorer 

* Prototype 3D interactive display of multiple level 
horizontal 2D contours over a map 

* Interactive arbitrary vertical cross-section using 
map locator selection 

* Prototype image flip-book animation 

* Particle trajectory calculation and display 

* Time handling within Explorer, including time-based 
interpolation and looping constructs 

* HDF data file interface to Explorer 

* Real time control of simulation running on Cray from 
within Explorer on workstation 

* Demonstration at SIGGRAPH '92 


J V 
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Prototype GEMPAK adaptation to AVS ? including 
port of GEMPAK to Convex C220 

Evaluated and selected visualization and 
application builder software 

Adaptation of GEMPAK to Explorer 

- GEMPAK grid file interface to Explorer 

- Grid scalar and vector diagnostics 

- Coordinate transformations / map projections 

- Topography and map displays 

Modification of GEMPAK to generate 3D vector 
data 

Display of 3D vector fields 


J 

Problems Encountered 

GEMPAK was conceived as an integral system. 

It’s hard to separate subsystems due to unexpected 
and hard to track connections. These problems 
have been resolved. 

Explorer Version 1.0 had problems, for example: 

- Bugs in: 

module grouping 

synchronization of data from various 
sources 

- Lacks functionality in: 

database management 
animation (scripting, looping, etc.) 
user interface management 
annotation 

Explorer Version 2.0 has addressed all of these 
problems except for database management, which 
we are addressing in our development. 

Explorer has no support for "missing" data We 
will develop our own tools for handing this. 

"Visual Programming Environment” sounds good, 
but can be confusing to the user. We are using the 
Explorer module grouping tool to manage the 
complexity of the visual programs. 
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What We've Gained Over 2D GEMPAK 

* 3-Dimensionaf Views 

* High Level of Interactivity 

4 Distributed Processing 

* Easy Extensibility 


What We’ve Lost From 2D GEMPAK 

* Annotation and Physical Coordinate Display 

s Standard Meteorological Symbols 

* Ability to Run on Low-Performance 
Machines 


A 

Future Work -- Near Term 


We will release software for Use in Severe Storms 
Branch of NASA / GSFC in October, 1992. 

* Minor User Interface Improvements and Fixes 

- creation of a set of "canned" processing 

maps 

- module grouping to simplify the interface 

- minor changes to improve consistency 

among module interfaces 

* Preliminary Annotation 

- axis labels 

- titles 

- limited control of size, placement, etc. 

* Image Loop Animation 

- the ability to store a series of rendered 

images either in memory or on disk 

- the ability to "play back" the sequence of 

images with some control over speed 
and order of images 

* User Documentation 


V. 


J K. 


1 

Future Work - Long Term 


Future Work - LonaTerm tcont.) 



* Annotation 

* Database for Meta- Data 


- full control of size, color, placement, etc. 

- units of data 


- axis labels 

- time tag 


- color legends 

- physical coodinates 


- time 

- etc. 


- titles 



- etc. 

* User Interface Enhancements 



For example, self-configuring control panels 


* Investigate Performance Enhancements 



For example: 

* Enable the User to Query Quantitative Data 


- geometry caching (storing geometries for 

from the 3D Scene 


each time step, so that they can be 



sequenced rapidly) 

* Additional Visualization Techniques 


- memory management to reduce paging 

For example: 


(data compression might be used to 

- better use of transparency 


reduce memory usage) 

- different vector representation 



- increased use of color 


* Port Capabilities into AVS 

* Animation 


* Incorporation of Imagery 

- scripting 



- time sequences of data 


* Improved Handling of Missing Data 

- animation of parameter changes such as 



isosurface value, cross-section 


* On-going investigation of extensions in areas 

position, or view point 


such as new visual representations, virtual 



reality, and video 

L J 


l 4 Beta Release Through COSMIC & Unidata j 
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TECHNIQUE SUCCESSFULLY USED ON SEVERAL HUNDREO VOYAGER IMAGES 
TO REMOVE A + 0 . 85 ° ERROR IN ROLL ANGLE! 
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SAIC/SAVS 
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Link Winds: An Approach to Visual Data Analysis 
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INTRODUCTION 

Modem space, sensor and computer tcchnolog ics have made it possible lo understand the Earth and its 
environs as never before Population pressures and modem technology have also tended to make it 
imperative that we do so To accomplish this, tremendous masses of dan must be gathered, transported, 
stored and understood To address some of these topics, a program of research is being conducted into 
the application of computer graphics to the problems of quickly exploring and analyzing very large 
amounts of scientific or engineering dam The objectives of the program arc (1) to develop a software 
environment which will support the rapid prototyping of visual data analysis applications, while at the 
same time maintaining the high level of performance necessary for inicractivcty manipulating graphical 
displays. (2) to develop a user interface that is truly intuitive and easy lo learn and allows quick access to 
the software Tor the novice as well as the advanced user. (3) lo provide a suae of sample applications 
which arc useful across a variety of scientific disciplines, and (4) to provide tools to support user 
development of applications for this environment 

LINK WINDS 

The Linked Windows Interactive Data System, or Link Winds, version 13 is a prototype product of this 
research erfon In compliance with our research objectives, it is a visual data analysis/exploration system 
designed 10 rapidly and interactively investigate large multivariate and/or multidisciplinary data sets to 
detect trends, correlations and anomalies LinkWinds is aft integrated multi-application execution 
environment with a full graphical user interface (GUI) The system, operating under Unix, is based on an 
object-oriented programming model and is implemented in the C-languagc For its graphical user 
interface and graphics support software, it draws upon the Silicon Graphics inc (SGI) GL-library. and 
presently runs only on workstations supporting this library This includes all SGI workstations, and those 
of other manufacturers who have licensed and support the GL-library . 

Individual tools and data sets arc coded is objects, each occupying a window on the LinkWinds screen, 
and communicating with other objects through a message passing protocol The objects or windows, 
containing data displays and controls for manipulating these displays, can be linked or unlinked at the 
discretion of the user The act of linking the windows together sets up one-way message paths This 
data- 1 inking paradigm makes the system perform much like a graphics spreadsheet and as in a 
spreadsheet is a powerful way of organizing the data for analysis while at the same time providing a 
natural and intuitive interface Dam-linking, and its user interface implications arc discussed below 

Messages generated by LinkWinds objects are recorded as program statements in an underlying language 
called Lynx, which is based upon Scheme, a dialect of Lisp. A modified Scheme interpreter is included 
in the system The message passing characteristics are the basis for two key LinkWinds functions The 
first or these is the maintenance of an internal journal or all user originated commands executed by the 
environment This file can be saved ai aiwume through a menu option The record can then be replayed 
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which lists all of the databases of interest to the user Those listed appear in the top level ’Databases 
menu The second file is a data description containing the filenames of all data sets to be associated 
during the analysis, the number of axes and their names, any metadata needed to translate axis values to 
numbers meaningful to the data, etc Data sets listed in these files appear m the top-level Data menu 
A palette may be included in the KDF file if the user desires, or it may be defined in the data description 
file. Colors can also be assigned during a data analysis session using a color management tool provided 
A much greater range of data formats will be acceptable to LinkWinds in future versions. 

APPLICATIONS OVERVIEW 

A suite or applications useful across many disciplines has been developed for the LinkWinds 
environment Figure 1 shows a typical session to explore a data scl collected by the Microwave Limb 
Sounder (MLS) currently in orbit aboard the Upper Atmospheric Research Satellite (UAJtS) The 
LinkWinds top-level menu is shown on the left and data objects, with their single link buttons, arc in the 
upper left-hand comer In this case, the data displayed are ozone and water vapor The window enutled 
Image! comaias a xhcc of the data at an altitude of 21 M mbars. as selccicd by Slidcrl which is linked to 
it lmacc2 is also linked to Slidcrl and shows the water vapor ai ihe same alutude The southern 
hemisphere ozone hole is shown at the lower left oflmagc! , and a high value of ihe waier vapor is shown 
tn a corresponding location in Image! The anti correlation is shown in Scattcrl. where ihe points shown 
come from the bounding box shown in Image I. controlled by linking Image 1 lo Scattcrl The ozone data 
are displaved in Globe! as a height field rendered on a sphere The ozone hole associated with the south 
pole is clearly seen. Slidcrl also controls the depth of this display, and the height scale is controlled by a 
vertical slider along the right side of the window Pan/Zoom and 2-Axis Rotator controls arc also linked 
io Globe 1 through the Animator The Animator makes it possible to select a starting scl of control 
settings,- and an ending set Then it will automatically record and save the number of frames selected 
with its sltder, resulting in an animation of the sequence of sellings from sun to end This sequence can 
be replayed later 

Figure 2 shows coregistered data collected in the Dcadman Butte area of Wyoming Line Plot# 
applications show spectral profiles for each of the data Sets, and is controlled by a cross-hair on Image* 
dynamically associating a spectrum with each point on the terrain Sliders on Line Plot* allow the 
selection of three or the channels to be colored red. green or blue, and RGBFilter* permits interactive 
color stretching of each of these channels. These also control the colors used for the perspective 
renderings Of the data in Plane# Thematic Mapper (TM). Quad-pole Synthetic Aperture Radar (SAR) 
and Thermal Infrared Muluspcctral Scanner (TIMS) data are superposed on an elevation map of the 
terrain in Plane 1. Plane2 and Planc3. respectively These in turn are oriented in unison by the Pan/Zoom 
and 2-Axis Rotator controls 
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ai the initiation of subsequent LinkWinds sessions, allowing the user io draw upon a previous layout of 
LinkWinds applications and links, or repeal a full analysis session 




The second function bawd upon the Lynx message passing protocol is the multi-user science 
environment (MFSE) which provides a method for multiple LinkWinds systems to communicate via 
networks. Using menu options, users remoicly separated can connect to one another, and by also 

establishing a telephone voice conncciinn. can cooperatively view and manipulate their data A 

successful connection requires that each user be executing LinkWinds and dial each has access to the 
data sets that are being analyzed This is normally arranged by transporting the data sets to each user 

poor to the collaborative sessions The MFSE. capability is also used to give tutorials over the network 

to new' users unfamiliar with LinkWinds, and to allow users to demonstrate recommendations for 
application changes or io point out bugs 


Hard copy of the LinkWinds displays arc provided by function keys on the keyboard Placing the cursor 
in a window, and pressing FI produces an image of a window's contents; pressing F2 saves the complete 
window, and F3 saves the lull screen The figures shown were obtained in this manner 

DATA -LINKING AND THF USER INTERFACE 


In addition to the normal GUI functions provided by the windowing environment, dynamic manipulation 
of graphs and images is facilitated through the da to-linking paradigm Data -linking can he understood in 
the context of a spreadsheet, where cells containing numbers are linked to other cells Formulae arc 
associated with each cell, so dial when a number changes all cells linked to Uk changed cell recalculate 
their values and update LinkWinds discs the same Hung, but m a graphics environment where the rigid 
gnd structure gives wav to free form, and a cell can translate, for instance, into a slider or large scale 
number arrays such as images, 


The user interface based upon the dau-linking paradigm is one of tlx most distinguishing features <»r 
LinkW’inds It evolved from a desire to create a truly easy to learn and intuitive user interface We are 
guided bv Ihe principle that users are impatient and want to gel started on productive work as quickly as 
possible Therefore, an interface was needed which can be teamed by exploration, and which conforms 
to user expectations as they work with it 


Dau-Iinking is affected through two icons The link icon is a button displaying two interlocking rings, 
while the unlink icon displays two nngs that are separated Each object on Hie screen has either a single 
link button, or the full set of link and unlink buttons Hie presence of a single link hutton indicates a data 
object, while the presence of the pair indicates applications with control functions to perform a link, the 
cursor ts placed on the appropriate button, and a "rubber band is dragged out and dropped into the 
application to be linked To break the link, the same thing is done using the unlink button There are two 
simple rules to follow in applying the linking paradigm 

t . When as a resuli of menu selections an empty window appears on the screen, put data in it This »> 
done by linking a data object into the window 

2. When an object with the pair of link symbols appears, exercise its control function by linking it 
with any application object 


DATABASE INTERFACE 




The current version of LinkWinds accepts data in the S-bit rosier Hierarchical Data Format (HDFi 
created and supported by the National Center for Supercompuung Applications (NCSA) at die University 
of Illinois. Champagne/Urbana A 2D data file is a single image, while a 3D data file is a sequence of 
images The data ingestion is controlled by two text files which arc generated by the user Sample 
versions of each of these files are provided and serve as self-explanatory template*. The first is a file 


s 


Figure 1 - Unwinds session to explore upper atmospheric ozone and water vapor measired 
by the Mierwave Limb Sounder aboard UARS. 






Figure 2 ■ Coregisfered special data gathered in Ihe region of Deadman's Butte in Wyoming. 
Several spectral bands are shown rendered in perspective 
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Executive Summary 


THE IMPORTANCE OF VISUALIZATION TO THE EOS MISSION 

• Visualization is a vital component of the tools required for meeting EOS scientific 
objectives. Neglecting visualization could result in failure to meet these objectives. 

• At present, NASA has given considerable attention to the dau retrieval, data management, 
and scientific objectives node? of the data pipeline, but has given minimal attention to the 
role or visualization within the EOS project. The justification for this approach has been 
primarily rooted in the belief that Commercial Off-The-Shelf (COTS) software, as well as 
modest development activities of NASA and other agencies, will be adequate to meet these 
needs. 

• This initial report highlights successes and general deficiencies with the current state of 
visualization development and application. It is based on the results of meetings with 50 
groups of EOS scientists and developers at 30 sites, as well as the author's experiences in 
trying to meet visualization needs at NASA MSFC. A follow-up report will examine 
possible options for assuring that there are adequate and proper visualization tools for 
meeting the scientific objectives of the EOS mission. 

APPLICATION OF VISUALIZATION WITHIN THE EOS MISSION 

• Applications of visualization within the EOS project include: 

- Scientific Investigation 

- Data Validation 

- Model and Algorithm Development 3rd Validation 

- Data Browse 

- Information Transfer 

- Mission Operations 

BACKGROUND ON VISUALIZATION 

• Visualization, although a relatively new discipline, actually consist of the unification of 
several fairly mature components, including: 

- Image Processing 

- 2D Data Plotting 

- 3D Computer Graphics and Animation 

- Volume Rendering 

- Geographic lnformailon Systems (GIS) 

- Computer-Human Interactions (CHI) 


Each component provides important capabilities to Earth systems scientists, but, at 
present, each also has specific deficiencies for meeting EOS needs. 

CHANGING ROLES AND CAPABILITIES 

• Due to rapidly increasing CPU and graphics power available to the scientist at his desktop, 
two major transitions arc occurring within the scientific computing environment, including 
transitions: 

— from centralized to distributed computing 

— from batch-mode operations to interactive computing 

Both of these transitions are putting more computing power and control directly in the 
hands of the scientist. It is this interactivity that will result in the greatest benefits to be 
derived from scientific visualization. However, without the proper software to take 
advantage of this power, these benefits will not be realized. 

• These transitions are demanding changes in the roles played by scientists and computer 
specialists, as well as requiring changes to our visualization and analysis tools. The batch 
mode of handing off visualization jobs to computer specialists is no longer adequate 

• The movie malting era or visualization was a necessary and important phase of these 
transitions, but docs not represent the total required direction Tor visualization 

• The most important immediate direction for visualization cfTous is that of putting useful 
and usable Interactive tools into the hands of the scientist. 


WHY AREN’T SCIENTISTS USING WHAT'S AVAILABLE? 

• Many of the techniques and components or visualization required ror meeting EOS 
scientific objectives arc available today. However, the actual use by the scientist of even 
our present visualization capabilities is well behind these capabilities. 

• Why aren't scientists using the visualization capabilities that are available to them today? 
Reasons include: 

— The too! is not extensible or is too inflexible. 

— The tool is too difficult to learn and use. 

— It is too difficult to get existing data into the tool. 

— The tool docs not adequately link visualization and analysis. 

— The collection of tools, as well as the data, exist within a complex heterogeneous 
computing environment. 

— The tool does not do what the scientist needs to do. 

— The scientist is not aware that the tool exist or that it meets his/her needs 

— The too! is too costly. 
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- The scientist does not have access to adequate hardware for running the tool. 

- It is too difficult to communicate the results of the finding to others, because (a) it is 
too difficult to get color prints or video, (b) it is difficult or impossible to interact 
remotely with colleagues, or (c) the publishing industry is too technically and 
philosophically archaic to meet the present needs for color hardcopy, animation videos, 
algorithm and application exchange, and voice and sound annotation. 

WHERE ARE THE VISUALIZATION BOTTLENECKS ? 

• With Tew exceptions, hardware capabilities 3re not, at present, a major limitation of our 
visualization environment. The commercial marketplace is probably adequate for assuring 
necessary advances in hardware technology. 

• The primary bottleneck is the lack of adequate software which allows the scientist to take 
advantage of this power and to interactively visualize and analyze his/her data within our 
complex computing environment. It is questionable whether Commercial Off-The-Shelf 
(COTS) software will be adequate for meeting all or the visualization needs of EOS. 

COTS, PUBLIC DOMAIN, AND IN-HOUSE DEVELOPMENT 

■ There are advantages and disadvantages to relying on either COTS or in-house developed 
software for meeting the visualization needs of EOS A proper balance between COTS, 
public domain, and in-house development is advantageous, but this balance must be 
accomplished with adequate and properly-direcied support. 

• If NASA is to rely more heavily on COTS software, then it must re-evaluate and improve 
the way that it deals with COTS developers. 

CURRENT DEVELOPMENT ENVIRONMENT WITHIN NASA 

• NASA funded development has resulted in several significant, leading edge visualization 
and analysis tools. 

« Unfortunately, with particular regard to visualization, the development environment within 
the OSSA can be characterized as: 


- Fragmented, with little overall direction or coordination 

- Lacking an established and understandable organizational structure for Ending both 
genera! tool and application -specific development 

- Lacking a firm commitment or plan Tor meeting the true visualization needs of the 
EOS science community 

- Lacking adequate mechanisms for technology transfer both within OSSA and between 
OSSA and OAST 

- Experiencing inefficient use of limited funding 


programming principles would minimize redundant programming while meeting 
application -specific needs. 
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Simplification of the Complex Heterogeneous Computing Environment - Scientists 
and computer specialists are forced to operate within a complex heterogeneous 
computing environment, consisting of incompatible operating systems, graphics 
protocols, networks, file formats, and output devices. ElTorts to homogenize this 
environment will help to a limited degree, but would probably be only temporary relief. 
There is a vital need for utilities which shield scientists from dealing with unnecessary 
complexities, allowing them to concentrate on analysis or data. 

Development or New Techniques and Components - The exploration of new 
techniques for analyzing and visualizing data should continue. In a few cases, very 
different approaches to analyzing and visualizing data may be introduced which 
require development of a complete application. However, in most eases, the 
introduction or new techniques should require only the development of a module or 
component which can be added to existing applications. 

Education and Communication • Many or the present challenges in visualization are 
the result of a lack of proper communication and education. Scientists and project 
managers must become better educated as to the scientific benefits or visualization and 
the availability of existing software, while in-house and COTS developers must become 
more aware oT the true needs and objectives or the scientist. 

Distribution and Maintenance - The distribution, maintenance, and support of COTS, 
public domain, and in-house software is a significant challenge. Proper mechanisms 
for licensing and supporting COTS software within our distributed environment should 
be in place to assure their availability and testing. A major challenge with in-house 
development is the distribution and maintenance of successful application 
development, and the transition of such programs from the experimental stage through 
maturing and operational stages. 


CONCLUSIONS 

• Visualization is vital for meeting the scientific objectives of the EOS mission. 

• Although our present suite or visualization techniques Is impressive and powerful, the 
application software for pulling these capabilities into the hands oT the scientist is, at 
present, inadequate. 

• The development of extensible, uscr-rricndly, object-oriented software within the 
commercial software industry, is helping to increase the probability that COTS software 
can serve as a core for meeting many visualization needs. However, there will still be a 
need for in-house efforts directed at extending and modifying these tools to meet 


• Required in-house development is of two types: (I) development of general tools to meet 
the needs of a wide range of users, not being met by existing software, and (2) extension, 
modification, and integration of existing tools to meet application-specific requirements. 


DIAMONDS AND DINOSAURS 

• A difficult challenge Tor NASA will be that of recognizing and supporting in-house 
'diamonds" (experimental development efforts which hold much promise for scientists) 
and of preventing, recognizing, and dealing with 'dinosaurs’ (old development programs 
with limited momentum and decreasing application). Present mechanisms for transferring 
successful development activities into usable technology is inadequate. 


MAIN AREAS FOR CONCENTRATION 

• The main general areas presently needing consideration include: 

— Integration of Visualization with Data Management and Analysis • Data 
management, analysis, and visualization represent a triad of functionality required to 
meet scientific objectives of the EOS mission. To meet these objectives, this triad must 
be properly balanced and adequate bidirectional finks must exist between each 
component. Links from visualization to analysis, and from visualization to data 
management, are essentially nonexistent, at present. Navigation, or knowing the 
spatial and temporal location of each data point, spans all components of this triad and 
is a particularly critical subset of this issue requiring immediate attention. 

— Application Development for the Scientist as the End-User • The most important 
immediate need for visualization is that of getting interactive tools into the hands of 
the scientist. This requires that these tools meet the actual needs of the scientist, be 
simple and intuitive to use, and be logical to the scientist rather than to a computer 
specialist. 

- Providing Extensibility without Complexity - Extensibility of visualization tools is a 
vital requirement, particularly for COTS software. This extensibility must be provided 
without a significant increase in complexity of use. or any resulting complexity must 
be able to be hidden from the scientist by customizable interfaces. 

- Application-Specific Programming without Redundant Programming - Redundant 
programming within the Earth systems science community is extensive. Often new 
development efforts arc undertaken in order to provide functionality not available from 
existing software. However, in order to provide new application-specific functionality, 
many development activities spend a very large portion of its efrons on redundantly 
programming functionality that already existed in several other programs. The 
availability and application of extensible software and the adoption of object-oriented 
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application -specific requirements, and Tor developing leading edge techniques and 
applications not available ftom the commercial front. 

A follow-up report will investigate viable options for meeting the visualization 
requirements of the EOS mission in the future. These options will focus on two major 
objectives: increasing the crfcctivc use of visualization tools by scientists, and maximizing 
the return on development efforts. 
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THE IMPORTANCE OF VISUALIZATION TO THE EOS MISSION 

The proper storage, management, and distribution of the enormous database to be generated by 
the NASA Earth Observing System (EOS) Mission is extremely important to the scientist 
requiring access to this data However, without the proper tools to also analyze these data sets 
this enormous database is suicily that, a database The successful accomplishment of ihe 
scientific objectives of EOS is highly dependent on the scientists* ability to analyze and 
interact with these data in order to retrieve the important knowledge hidden within this 
database. The only way to effectively retrieve significant information from such an enormous 
database is through a well-balanced use of data management, analysis, and visualization 

As illustrated in the simplistic cartoon of Figure I . the typical data now of EOS-rcIatcd data 
moves from the satellite and ground-based data sources, into the data storage and data 
management nodes, and to the analysis/visuallzation nodes where the scientist gains scientific 
insight through his perception of the analytical and visual results. Results must then be 
communicated to colleagues or policy makers through speech or visual means. A bottleneck at 
any point along this path How will result in less than successful completion or the scientific 
objectives of EOS. At present, NASA has given considerable attention to the data retrieval, 
data management, and scientific objectives of the data pipeline, but has given minimal 
attention to the important role of visualization. The justification for this approach has been 
primarily rooted in the belief that Commercial Off-The-Shelf (COTS) software, as well as 
modest development activities or NASA and other agencies, will be adequate to meet these 
needs. 

The purpose or this study is to investigate the needs and application of visualization tools 
relative to the EOS objectives, to become familiar with mature and on-going visualization 
development efforts or industry. NASA, and other groups, and io evaluate whether ihese efforts 
will be adequate for meeting the scientific objectives 0 rEOS. The results arc based on the 
results of meetings with 50 groups or EOS scientists and developers at 30 sites, as well as the 
author’s experiences in meeting visualization needs at the NASA Marshall Space Flight Center 
(MSFC). This initial report highlights successes and deficiencies with the current state of 
visualization development and application, and recognizes areas where attention should be 
focused. A follow-up report will examine possible options for assuring that we have adequate 
and proper visualization tools for meeting the scientific objectives of the EOS mission. 


APPLICATION OF VISUALIZATION WITHIN THE EOS MISSION 
There are five major applications of visualization within the EOS mission: 

• Scientific Investigation 

• Data Validation 

• Model/Algoritbm Development and Validation 

• Database Browse 

• Information Transfer 

• Mission Operations 
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Visualization Is proving invaluable in providing new insight into Earth systems science 
problems. Whereas scientists were previously limited to painstakingly analyzing a small 
subset, or profile, of the an available data set. visualization is i;ow providing scientists with 
the ability to observe and analyze complete data sets within 4D spatial and temporal domains 
Even more significant is the ability to view, analyze, and correlate several data sets, related in 
space and time, in order to belter discern the relationships and interactions of various physical 
factors. The interactivity of these tools allows the scientist to more effectively probe the data 
and the analysis process, and to receive immediate response to 'what-if questions. 

Even simple animations have been shown to be very useful in pinpointing errors within data 
and computer models, and in many eases, in tracing the source of these errors This is 
particularly important when locating the source of propagating errors within model runs. Such 
errors may not be spotted, otherwise, unless they result in fatal errors, and even then it 
becomes quite difficult to trace back to the source of the error without Ihe assistance or 
visualization. As visualization techniques become better combined with analytical methods, 
data and model validation, using visualization, will become much more effective and efficient. 

The abundance and size or data sets available to an EOS investigator, will necessitate the use 
of data browse capabilities, The efficient use of network traffic, computer resources, and the 
scientist’s time will surfer significantly without the scientist's ability to quickly transfer and 
visualize greatly compressed data subsets, in order to predetermine the scientific relevance of 
the full data set. 

Finally, visualization offers one of the most effective and intuitive means or communicating 
large amounts of scientific information to colleagues, program managers, policy makers, and 
students. This importance of visualization for education and communication should not 
overlooked. Furthermore, as remote interaction of science via graphics workstations, becomes 
more a reality, the use of visualization for information exchange between scientists will 
become routine and very effective. 


BACKGROUND ON VISUALIZATION 

Although the field of scientific visualization is still in its infancy, it actually involves the 
unification of several traditionally independent disciplines, some of which arc fairly well 
developed. These include: 

• 2D Data Plotting and Contouring 

• Image Processing 

• 3D Computer Graphics, CAD, and Computer Animation 

• Volume Rendering 

• Geographical Information Systems (CIS) 

• Computer-Human Interaction (CHI) 

Each of these disciplines has provided, and will continue to, provide significant tools Tor 
scientific investigations Each also has specific areas of deficiencies which will be discussed 
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in more detail in a laicr report. For example, many of the 2D and 3D plotting applications still 
retain the mentality of pen-plotter output, which docs not allow interactivity between the user 
and the plot displayed on a screen. Many visualization application programs have completely 
ignored the power and simplicity of plots and contour maps for displaying important 
information, much to the dismay of the science community. 

Although digital image processing is a very important component of visualization, many 
techniques for processing imagery arc rather archaic, particularly with regard to fully utilizing 
data available in more than three spectral bands. Also, many or the major application 
programs for image processing have also become somewhat archaic with regard to their user 
interfaces, program architectures, graphics capabilities, data structures, interactivity, and 
portability to RISC-based workstations. 

Advances in 3D computer graphics, CAD. and computer animation, made during the 1980's, 
provide a substantial base Tor the development of important tools for visualizing 3D. temporal, 
scientific data. However, having been developed primarily for the purposes of military 
simulation and broadcast animation, many of the algorithms and application programs don't 
meet the needs of the visualization community today. Similarly, while volume rendering holds 
much promise for detailed visualization or volumetric scientific data, most of the present 
algorithms were developed for medical applications and are not well suited for interactivity, or 
for dealing with temporal or non -Cartesian data. 

Likewise, GIS developed primarily within cartographic -based, rather than scientific, 
disciplines. As a result, several present deficiencies in traditional GIS techniques have been 
recognized which limit its effectiveness in scientific applications. These include the lack of 
effective means Tor dealing with temporal elements, the inability to recognize and measure 
potential errors resulting from different spatial and temporal resolutions between data sets, and 
the inadequacies for handling of "fuzzy”, or continuous, data boundaries. 


r lt is not only important that we recognize the deficiencies that exist in our present tools, but 
that we also understand the current and future environment under which improvements to our 
tools must be made. These arc examined in the following section. 


CHANGING ROLES AND CAPABILITIES 

The CPU and graphics power available to the scientist is increasing at a logarithmic rate or an 
order of magnitude every 3-5 years. At the same lime, this power is becoming available on 
low-priced workstations. This reality is moving visualization computing away from a highly 
centralized environment, in which expensive computer equipment was centrally located and 
only highly specialized personnel operated this equipment, to a more distributed environment, 
where the scientist has significant computing power at his desk, while still having access to 
greater computing power and large amounts of data over the network. The scientist is thus 
becoming more in control of his computing environment, as well as taking on greater 
responsibility Tor the operation of his computing tools. 

In addition, the increase in computing power and the decentralization of the computing 
environment arc allowing much greater interactivity than was previously possible. Assuming 
that the proper visualization and analysis tools arc in place, it is this interactivity between the 
scientist and his/her data that will result in significant advances in the Earth systems science 
community. The batch mode of computer operation is no longer adequate for meeting the 
demands of scientific analysis. 

We arc presently in a slate of transition from centralized to distributed computing, and from 
batch to interactive processing. These transitions arc demanding changes in the roles played 
by the scientists and computer specialist, as well as requiring changes to our visualization and 
analysis tools. 
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Finally, the field of CHI is providing important insight imo effective interaction between man 
and computer. The design and application of proper graphical user interfaces (GUIs) and 
intuitive program structure arc crucial to putting effective visualization tools into hands of the 
scientist. In addition, a better understanding of factors which affect human perception is 
certainly important to the design of visualization tools, but is an area that is often overlooked 
during their development. 


Whereas the '80s was the decade Tor rapid advancement of computer graphics related to 
broadcast animation, military simulation, and industrial CAD. the '90s is promising to be the 
decade for scientific visualization. Many of the present efforts have simply involved the 
application of individual components, such as computer graphics and animation, to scientific 
data. Certainly, the techniques developed within each of these components represent a 
substantial base Tor visualization capabilities. However, the success of visualization as a 
scientific tool will depend on the proper adaptation of these components to meeting the needs 
of the science community, as well as seamless integration of all of these components imo 
application programs which arc useful and usable to the scientist. 



As an example of these changes, consider the roles and environments existing in typical 
visualization activities or the past and present. Previously, it was adequate for a scientist to 
request, from an image processing Specialist, that he or she apply a principle component 
analysis to a given Landsat scene, stretch the resulting bands over 8 bits, create an RGB 
composite of principle components 2, 3, and 4, and provide a color print of the resulting 
image. The required batch operation was easily defined by the scientist, who could in the end 
study the final image for weeks or months, creating hand-drawn overlays to illustrate scientific 
findings. 

In contrast, the scientist of today might wish to study and correlate several data sets which are 
related in 3D space and time, including, for example, satellite imagery, data measured along a 
flight path. 3D volumes from radar, and 3D gridded data from a numerical model. In order to 
gain significant insight from this study, the scientist will probably require, at a minimum, the 
ability to render representations of these data in 3D space, interactively move the point of view 
around the 3D space, view and control the animation of the data through time, control the 
representation or the data (c.g. color maps, transparency, etc,), evaluate the data with an 
appropriately-placed 3D probe or 2D slice plane, and generate relevant plots ’on-thc-fly’. 
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There is no longer a definable product that can be demanded of the computer specialist by the 
scientist, other than that or providing the scientist with proper tools, usable by the scientist, of 
properly integrating the scientist’s data into the tool, and of providing operational training and 
assistance when required. 

During this transition period, it is expected that we should see some remnants of our previous 
roles and computing environments. In fact, the proliferation of sophisticated movies showing 
scientific visualization is somewhat a symptom of this. While these movies have undeniably 
provided benefits in the form of scientific insight, public relations, and awareness of the power 
of visualization, they are not the future direction for visualization that is required to meet the 
total needs of the scientist. Far from being interactive nor analytical, these movies have often 
been created at great cost in time and funding, typically requiring weeks to months of effort 
from teams of scientists, programmers, computer visualization experts, computer animators 
and artists, and multimedia specialists. While this movie making is a natural extension of the 
80’s style of conqpu ter animation, as well as an extension of the previous roles of the computer 
specialists, many short-sighted scientists have, as a result, developed the view that 
visualization is strictly for creating ‘pretty pictures" and noi a tool for doing significant 
science. In others, these movies have instilled false security that the required visualization 
tools required are in place. In reality, scientific animation and movie making represent an 
important part of visualization, but they arc only a small part of the requirements for 
visualization. 

In essence, the most important immediate direction for visualization efforts is that of putting 
these tools in the hands of the scientist, allowing him/her to interactively probe the data 
without the constant aid or computer specialists. Computer visualization specialists should be 
involved in getting these tools into the scientist's hands and modifying and expanding these 
tools as required, and not in operating the tools Tor the scientist. Thus, the role of the 
computer specialist is moving away from the backend user of the tool, to the frontend gatherer, 
tester, and modifier of the tool. This, of course, requires that the application be well designed 
for the scientist as an interactive end user, with particular attention given to meeting the 
scientist's needs in 3 simple and intuitive manner 

In view of this, it is important to recognize the diversity that exists among scientist with 
regard to their application or computer tools. While not based on any statistical surveys, it 
appears that approximately 20% or the scientists arc comfortable with and actively involved in 
finding, learning, modifying, and applying the computer tools that arc available, and in many 
cases are willing to work with developers on improving these tools. At the other extreme, are 
approximately 20% of the scientists, who believe that the tools that they have employed in the 
past are adequate for meeting their needs, or who, because of lack of time or desire, only wish 
to be given a tool that requires very little time or attention to learn and operate. In lieu of 
such a tool, this group of scientists will rely heavily on computer specialists for all aspects of 
tool application. The remaining 60% of the scientists are reluctant or unable to concentrate a 
lot of attention on visualization and analysis tools, but will invest the required time to test and 
learn new tools if properly nurtured and if benefits become apparent. 




Scientists who are willing to invest lime and effort in the testing and modification of tools are 
a very vital asset to both the developers of these tools and to the scientific community, as a 
whole. These scientists should be encouraged and supported, not ostracized by other scientists 
and funding agencies as has occurred in the past. However, it Is also important for developers 
to be aware that these particular scientists represents a select group and not the majority of 
scientists who will not use a tool that is difficult to learn or use, or that docs not immediately 
meet their needs, 


WHY AREN'T SCIENTISTS USING WHAT'S AVAILABLE? 

With a Tew exceptions to be discussed in a later report, many of the techniques and 
components of visualization required for the EOS mission arc available today; these will 
probably progress and others will continue to be developed regardless of NASA's direct 
involvement. Certainly, NASA’s support of new techniques for visualization and 
analysis would greatly improve our future abilities and would direct development efforts 
along paths that are critical to NASA's objectives. 

In essence, although many of the components do exist to meet many of our present and 
Tuture visualization requirements, the actual use by the scientist of even our presen i 
visualization capabilities is well behind the development and availability of these 
capabilities. The major challenges and deficiencies with visualization, at present, lie in 
the integration of these components and techniques into applications that arc usable and 
useful to the scientist in his everyday work. 

A major question to ask. then, is 'WHY AREN'T SCIENTISTS USING THE 
VISUALIZATION CAPABILITIES THAT ARE AVAILABLE TO THEM TODAY?" 

The answers to this question begin to provide an understanding or the directions 
required for improving the state of visualization within the scope of the EOS mission. 
Although not every problem discussed below applies to every available application 
program, most every visualization tool or collection of tools required for a particular 
task suffers from one or more of the following deficiencies: 

• The tool is not extensible and is too inflexible to allow scientist to add 
discipline specific functionality or incorporate existing modules: thus, too 
many tools arc often required to meet the scientist's tasks. 

• The tool Is too difficult to learn or use. Th is is particularly troublesome if 
more than one tool is required to meet scientist's needs Difficulties of use can 
exist due to archaic command fine interfaces or as a result of program structures 
and graphical user interfaces that arc intuitive to computer experts, but not 
scientists. 
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• It is too difficult (o get existing data into the tool: this is generally a result of 
(a) incompatible data formats and the lack or available data niters, (b) the lack 
of true integration between the visualizat ion/an a lysis tool and the data 
management system, and (c) the exposure of ihc scientist to the complexities of 
networking. 

• The tool docs not adequately link visualization to analysis, which is critical 
to the scientific process. Going "back to the numbers' or more importantly, 

'back and forth between the visual and analytical’ is crucial before visualization 
can become a 'real' science tool in the scientist's mind. 

• The collection of tools, as well as data, that arc required for meeting a 
particular science need, exist within a complex heterogeneous computing 
environment. For example, the required tools might be running on different 
architectures with different operating systems, data might exist on different 
hosts as incompatible binary files, and the application might require different 
data formats. In addition, output or results may require completely different 
formats, complicated commands, and the need to transfer data to some other 
device on the network. 

• The tool does not do what the scientist needs to do, either because no tool can 
meet all needs or because the tool was designed with computer graphics, rather 
than science, in mind. 

■ The scientist Is not aware that the tool exist or lacks appreciation of the 
visualization tool as a serious tool that could greatly benefit his or her 
scientific research, or model and data validation, 

• The tool is too costly, particularly for desktop visualization. 

• It is too difficult to communicate the results of the findings to others. This 
results from at least three causes: (a) it is too difficult to get prints or video 
from the tool or platform, (b) it is impossible or difficult for the scientist to 
remotely interact and discuss with other scientist, the intermediate and final 
visual results derived from the tool, and (c) the publishing world is not well 
equipped, nor philosophically structured, to deal with the present needs for 
color hardcopy, animation videos, algorithm and application exchange, and 
voice and sound annotation. 

• The scientist does uot have access to hardware required to run the application 
software, or the scientist's hardware is too graphically underpowered to run the 
application effectively. 

Because or these difficulties, the application of present visualization tools have been 
inadequate and frustrating to scientists and computer specialists, alike. Many are not able to 
dedicate the time required to find and learn visualization tools which might meet their 


requirements, and have become frustrated and disenchanted with the present tools that are 
available to them. In order to "get the job done", many scientists often give up using present 
visualization tools in lieu of more ramiliar. but inadequate, analysis methods. The scientists 
who do persevere often reap significant benefits from the present visualization tools. However, 
a significant amount of their time and concentration, which could be applied to gaining insight 
into the d3ta. is often lost to learning, operating, and modifying the present tools. 


WHERE ARE THE VISUALIZATION BOTTLENECKS ? 

With a few exceptions, hardware capabilities are not. at present, a major limitation of our 
visualization environment The present hardware capabilities available in the market, greatly 
exceed the present availability of software tools with which the scientist can routinely take 
advantage of this power. Certainly, our increasing demands for very large data sets, our 
increased network traffic resulting from more distributed processing, the transfer of large files, 
increased multimedia activity, and the use of X window protocol, and our need Tor interactivity 
in visualization and analysis, arc resulting in a constant requirement for higher CPU and 
graphics power, and wider bandwidths throughout the data path. However, with the possible 
exception of network requirements, the commercial marketplace is probably adequately suited 
for assuring that the capabilities of available hardware will continue to increase at a rate 
commensurate with our demands and with our ability to take advantage of the available power 

With the capabilities of computers increasing by an order of magnitude every 3 to 5 years, it is 
important to consider the short time frame in which our capabilities increase and our present 
computers become obsolete. In addition, considering that the computer is possibly the most 
vital tool of today s scientist, it is important that funding and expedient procurement of these 
tools be available. The lack of accessibility to adequate hardware power, particularly at 
universities, and lack of expedient procurement of required hardware, particularly at 
government institutes, are at present limiting the effective use and development of 
visualization tools in the scientific community. 

Still, the primary bottleneck in visualization, at present, is the lack of adequate software which 
would allow the scientist to interactively visualize and analyze his/her data within a complex 
heterogeneous environment, without the need for the scientist to be concerned with the 
underlying complexities of the tools. Although there have been some significant advancements 
In visualization software, the available software as a whole is. at present, deficient in meeting 
these needs. Unlike hardware, it is questionable whether Commercial Off-The-Shelf (COTS) 
or public domain software will be adequate for effectively meeting the total visualization 
requirements for EOS. The following section examines this issue in more detail 


COTS, PUBLIC DOMAIN, AND IN-HOUSE DEVELOPMENT 

The question arises as to what extent the needs for scientific visualization software can be met 
by relying on the commercial software industry or other outside sources, in lieu of development 
funded by NASA. Each source has advantages and disadvantages. 


The advantages of relying on COTS include: 

■ NASA pays only for working product not development or experimental failures 

• Products tend to be more polished 

■ Maintenance and support not as major an issue for NASA 

• There is generally more awareness of a product’s existence 

The disadvantages of relying on COTS include: 

• Applications have traditionally been inflexible and not extensible 

• There is no access to source code required for code modification or porting of the 
application to necessary platforms 

■ Applications tend to be designed for large market appeal, not application specific 
needs 

• There is a higher cost factor to user, but not necessarily to NASA as a whole 

• Users arc less apt to "try it out' if the software must be purchased first 

• There is no direct control of directions of future development, and no assurance that 
developers will be around next year 

• Application is often developed outside of the scientists’ realm, generally without a true 
understanding of scientific needs 

• Approach for meeting needs is generally more conservative and less creative 

In contrast, the advantages of relying on in-house development are. 

• Software is designed to meet general and specific needs of NASA's scientific objectives 

• Source code is available to all for modification and portability 

• The product is generally flexible and adaptable, or can be cannibalized for new 
applications 

> New creative and leading edge products often result 

The disadvantages of relying on ir-house include: 

• NASA assumes the risk of development with no assurance of success 

■ Maintenance, distribution, and support are significant challenges 

• There is a risk of the software becoming a costly dinosaur that NASA is forced to 
maintain 

• Without proper direction and coordination, there is traditionally much redundancy or 
programming with in-house development 

There are many commercially available visualization packages which arc well ahead of most 
in-house developed software. Although some of this commercial software arose from initial 
development efforts at NASA and other governmental institutes, they have generally become 
more polished and portable. There are also many NASA-funded development efforts that arc 
unmatched in functionality by the software industry. Still, in many cases, COTS software can 
probably provide the core for many visualization needs, particularly those products which are 
more extensible. However, if NASA intends to rely significantly on COTS software, then 
consideration should be given to changing the level or interaction with vendors, In particular, 
NASA should encourage more communication between scientists and the developers of COTS 
software. 


In addition, there arc many significant software contributions available within the public 
domain. Although a few of these, such as Khoros and MATLAB. arc large, refined 
development efforts, most arc primarily small-scale efforts which provide useful utilities or 
limited applications. Since the source code is generally available, public domain software is a 
good source for useful utilities or usable algorithms. Since maintenance and support with 
these packages is often a challenge for the developers, particularly for larger- scale efforts. 
NASA should consider mwnsof providing assistance for packages which it feels provide 
significant benefits to EOS scientists. 

While COTS software may play a significant role in meeting EOS visualization needs, relying 
solely on development from the general software industry may not be adequate. Particularly 
without guidelines from NASA, and changes to the way that NASA deals with vendors, it is 
risky to assume that the outside industry will take on the challenge to develop visualization 
software capable oT meeting the needs or the EOS program This is particularly true for 
application-specific needs which may not provide an adequate commercial market. 

A balanced approach between COTS and in-house development is probably the best solution for 
meeting the visualization requirements oT the EOS mission. The needs for in-house 
visualization development within NASA include: 

• Extension or modification of existing software to meet application -specific 
requirements 

• Contributions oT useful modules or components to the general programming pool 

• Leading edge (experimental) development of techniques and applications 

• App I ica l ion -specific programs not available from COTS and not able to be created by 
simple extension of existing programs 

These needs should be recognized and programs should be structured to adequately provide 
support for meeting these needs. 


CURRENT DEVELOPMENT ENVIRONMENT WITHIN NASA 

Development activities throughout NASA have resulted in some significant, leading-edge 
visualization and analysis tools. Both advanced and experimental visualization development 
has transpired within the Office of Aeronautics and Space Technology (OAST) to support 
flight operations, aeronautical development, computational fluid dynamics (CFD) research, and 
virtual reality. Although not presently well suited for meeting visualization needs oTEOS. 
some of these efforts could be modified to meet some of these requirements. 

Within the Office of Space Science and Application (OSSA), support Tor visualization 
development has been modest, particularly within the Earth Science and Application Division 
(ESAD) which supports the EOS project. Still, development activities within the Informations 
Systems Branch (Code SMI) and the various science branches of the ESAD and the Solar 
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System Exploration Division have resulted in successful tools, capable of meeting a limited 
part of the EOS requirements. 

Unfortunately, without proper direction and appropriate support, these efforts will be 
inadequate for meeting the visualization requirements within EOS. At present, visualization 
development activities within OSSA and the ESAD can be characterized as: 

• Fragmented, with little overall direction or coordination 

• Lacking an established and understandable organizational structure for funding both 
general tool and application -specific development 

• Lacking a firm commitment or plan for meeting the true visualization needs of the 
EOS science community 

• Lacking adequate mechanisms for technology transfer both within OSSA and between 
OSSA and OAST 

• Experiencing inefficient use of limited funding 

This environment is resulting in inefficient development which does not adequately address the 
full needs or the scientist. In particular, there is much redundant programming and little 
effort directed toward creating tools that can be easily extended and customized. Because of 
archaic programming practices, functionality developed within one program is not easily 
transferred into other development efforts. Attempts to fulfill application-specific 
requirements often result in large-scale application development rather than the smaller-scale 
addition of modules to existing applications. Finally, the absence of support fot small-scale 
application development within science proposals generally results in the use of inadequate 
tools to meet science objectives, or the scavenging of science funds in order to meet 
development requirements. 

Required in-house development is of two types: (I) development of general tools to meet the 
needs of a wide range of users not being met by existing software, and (2) extension, 
modification, or integration of existing tools to meet application-specific requirements. 

Unfortunately, the lack of extensible software in the past, and the lack of coordinated 
development activities at NASA, has not allowed a clear distinction between these efforts. 

Distinguishing between these efforts, and providing appropriate support for both, is vital to 
meeting present and future visualization requirements in a cost-effective manner. 

While development of general visualization tools should be intimately related to the scientific 
objectives of the EOS program, the current philosophy that most visualization development 
should be "piggy-backed" with science proposals has been ineffective. The reasons for this are: 

• Scientists are reluctant to include software development cost in science proposals Tor 
fear that the proposal costs will not be competitive: 
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• If only 80% of the contract is Rinded, software development is usually the first to be 
eut; the scientist will then fall back to available tools that arc probably inadequate for 
his/her needs; 

• Tying development intimately to a particular science proposal generally results in a 
very specialized and hard-wired tool, and docs not encourage general, reusable tools 
for meeting other science requirements: 

• Strictly creating visualization tools required for a specific science proposal docs not 
address nor encourage the exploratory development of new techniques required to 
advance visualization to a state required to meet the EOS objectives. 

It is recommended that general tool development be separated from specific science proposals, 
but be conducted by teams consisting of both computer specialists and scientists. In contrast, 
application-specific development could be directly tied to particular science objectives, but 
should receive, at least, minimal funding earmarked for that effort. 

DIAMONDS AND DINOSAURS 

A difficult challenge Tor NASA will be that or recognizing and supporting in-house 
"diamonds’ (experimental development efforts which hold much promise for scientists) and of 
preventing, recognizing, and dealing with "dinosaurs’ (old development programs with limited 
momentum and decreasing application). Diamonds typically begin as simple experimental 
ideas that, through proper development and strong acceptance by scientists, develop into 
applications with significant potential. However, without proper nurturing, such programs 
quickly falter due to stagnation, or more commonly, as a result of becoming overwhelmed with 
the responsibilities associated with distributing and supporting a successful application 
program. Without a proper mechanism in place to recognize these diamonds, and to assist in 
the transition of these development efforts, continued development and upgrading of these 
tools is sacrificed Tor the sake of maintena nce and support, and the science community losses a 
potentially valuable tool. 

At the other extreme arc the large programs with a long history of development and maturity 
which in many cases otter significantly more functionality than other existing programs, but 
which have become quite archaic with regard to their interfaces, program structure, data 
formats, graphics capabilities, and portability to modern computing platforms. This is 
primarily true with regard to image processing applications. Total reprogramming of this 
functionality using a more efficient, extensible, and portable approach, is an ambitious effort 
that has traditionally been avoided. However, simple porting to newer architectures is 
Inadequate. NASA must evaluate the importance of the functionality that exists within these 
programs and determine if and how it intends to incorporate this functionality into future 
applications. 

Preventing, or at least slowing, the creation of future dinosaurs is also an important issue. In 
fact, the issues involving diamonds and dinosaurs are not unrelated. The main factors 

V , / 


-V 

affecting the creation or dinosaurs from diamonds arc initial design, project transitions, 
upgrading, and perhaps, inevitability. Certainly, programs that are less flexible to extension 
and modification, or are too intimately tied to data structures, have a greater danger or rapidly 
becoming future dinosaurs. Again, the principles of abstraction and modularity inherent in 
object oriented programming could greatly increase the useful life of programs which adhere to 
these principles. Secondly, it is important that NASA properly assist the transition of the 
program through its various stages, while providing adequate funding for both user support 
and program upgrading. Still, while these measures may significantly extend the useful life of 
future development efforts, it is probably inevitable that any application will, at some point in 
time, reach a point of questionable usefulness NASA should also have a proper mechanism in 
place for phasing out continued support, while minimizing inconveniences to on-going users. 


MAIN AREAS FOR CONSIDERATION 

Based on these findings, the following general areas have been recognized as requiring 
consideration: 

• Integration of Visualization With Data Management and Analysis 

• Application Development for the Scientist as End-User 

• Providing Extensibility Without Complexity 

• Application-Specific Programming Without ReduDdant Programming 

• Simplification of Complex Heterogeneous Environments 
■ Development of New Techniques and Components 

• Education and Communication 

• Distribution nsd Maintenance 

These areas of concern will be discussed briefly below. More specific deficiencies and 
requirements will be discussed in a follow-up report. Recognition of these areas of required 
concentration docs not, by default, imply that NASA will need to take full responsibility for 
meeting these needs. In fact, meeting these needs will require a proper balance of efforts and 
adequate communication between NASA, vendors, contractors, and other outside developers. 

Integration of Visualization With Data Management and Analysis 

As illustrated in Figure 2. data management, data analysis, and data visualization should be 
viewed as a triad of functions which can act on scientific data. In order for this triad to 
operate effectively, there must be adequate data paths between all components and they must be 
bidirectional. At present, some tentative links exist within this triad, while others are 
nonexistent. In particular, the visualization function presently serves primarily as a backend 
process, with little return interaction with analytical tools or database management. 

In particular, the links between visualization and analysis are critical, but are, at present, 
inadequate. The application of visualization techniques can provide significant insight into 
scientific problems, but it is vital to the scientist to test and demonstrate this insight using 
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analytical methods. Until the scientist can easily and interactively 40 back and forth between 
the visual and analytical domains, visualization will not be considered a true scientific tool by 
the majority of scientists. 

Although data sets derived from analytical or visualization tools my be fed back into the 
database, the data management system still essentially acts as a frontend entity which provides 
data to be used by visualization and analysis Lools, but otherwise interacts very little with these 
functions. Particularly with the enormous database associated with EOSDIS and the increased 
size of many data sets, it is becoming more important for visualization and analysis tools to be 
able to query the data management system regarding content of the data. As an example, a 
visualization tool should be able to request of the data management system, 'Show me browse 
images of other data sets for the area in this bounding box, where the ratio of this parameter to 
this parameter is greater than this value.” Similarly, numerical models should be able to 
request additional data from the database based on immediate results obtained during the 
model runs. 

These examples imply more sophisticated data management capabilities and database than 
those required for simple inquiries based on sensor type, date or acquisition, and nadir 
location. They also suggest a different level of interaction between the client (in this case, 
another software application) and the data management system, than might be the case if the 
client were an actual user. 

Finally, navigation, or knowing the spatial and temporal location of each data point, is a 
critical issue that spans all components or this triad. The increasing requirement to combine 
data sets from a variety of sensors demands increased flexibility for navigation of the dau 
within all three components. Dau Management requires the ability to subset data based on 
precise spatial and temporal parameters, visualization requires the display of several data sets 
within a variety of projections, while analysis requires access to low-level daLa, as well as the 
ability to grid data into common grids. The present tendency to create higher level data sets, 
consisting of regularly gridded data in standard projections, only partly alleviates this 
challenge. Because each data set may have different scientific or instrument requirements, 
consensus with regard to standard grids and projections is difficult and ofren results in 
corruption and decreased scientific validity of the dau. Regridding previously gridded dau 
results in dramatically increased errors. It is therefore best to delay gridding and projection til 
as late in the visualization and analysis process as possible. This requires the availability or 
smart navigation tools, as well as the accessibility to low-level navigation information within 
the dau sets. 


Application BmJtoMngnLfttt Ifrc Scientist as End-User 

As discussed previously, the most immediate need for visualization is that of getting useful and 
usable interactive tools into the hands of the scientist. This requires that the tool meet the 
actual needs of the scientist, and not force the scientist to adapt to tools that are inappropriate 
for his/her needs. This also requires the operation of the tool, as well as data input and output 
of results, be intuitive and simple to the scientist. Interfaces should be simple and should 
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speak the scientist’s language where possible. The operation of the tool should be intuitive and 
logical to the scientist, rather than requiring the scientist to adapt to the logic of a computer 
graphics expert. In essence, the scientist should be able to focus on the analytical and visual 
results, and not on the operation of the tool. 

Few, if any, of the available visualization tools meet these needs, at present. Those which are 
simple to use 3re often inflexible and don't therefore meet required scientific objectives. Those 
which are extensible arc often too complex to operate and don’t, at present, allow adequate 
means of simplifying this complexity. 


Providing fjslcnsibilitv Without Complexity 

Many COTS visualization programs have not been well accepted into the EOS scientific 
community because of the lack of extensibility and flexibility. Without the ability for ihe 
scientist or computer specialist to input data in new formats, or to extend the functionality or 
the program with user-defined modules, the scientist is ofren forced to (I) adapt his/her data 
and needs to fit into visualization tools which arc inappropriate for analyzing the data, 
generally with poor results, (2) apply several independent visualization tools to the data in 
order to analyze different aspects of the data, (3) give up using present visualization tools in 
lieu of more traditional, but inadequate data analysis tools, or (4) initiate an in-house 
development effort which is costly in time and funding, with probably 50 to 75% of the effort 
applied to redundant programming of available capabilities. 

One of the main reasons quoted by scientists for initiating an in-house development ellori has 
been the need Tor extensibility and flexibility. This has also been a major reason for the 
success of many NASA-developed and other public-domain application programs, since source 
code is generally provided for these programs. Furthermore, the COTS visualization programs 
that have gained the most acceptance within the EOS science community are those which 
provide a high degree of extensibility and flexibility, including AVS. SCI's Explorer, NCAR 
graphics, PV-Wavc, 3nd IDL. The development of these tools, as well as public domain tools 
such as Khoros, are major milestones in the advancement of scientific visualization. 

However, the generality and extensibility of these tools have been generally accompanied by a 
high degree of complexity of use. PV-Wavc. IDL, and NCAR graphics require custom 
programming and command-line interfaces, while the graphical programming paradigms of 
AVS and SCI Explorer have given rise to the scl describing term, 'button hell”. These 
complexities, at present, limit the use of these programs to those scientists who have the time 
and initiative to become highly involved in the design or their tools In order for these tools to 
be applicable by a high percentage of EOS scientists, it is necessary that computer specialists 
be able to adapt these tools to meet the particular needs of a scientist or project while hiding 
the complexities of the extensible environment from the scientist using a simple and intuitive 
user interface, 

PV-Wavc and IDL have made great strides in allowing this, although these programs are 
primarily limited to interacting in 2D space. The developers of AVS and SCI Explorer are 

V / 


likewise aware of the issue, but must work out concerns regarding adaptive scripting, decision 
nodes, and user-defined GUIs before these programs will meet the needs of a large percentage 
of EOS scientists. Of course, any in-house or other NASA-funded development should also 
concerned with extensibility while allowing for simplification and customization, 


e Programming Without Re dundant Programming 

Redundant in-house programming within the Earth systems science community is extensive. 
The primary reason given for abandoning OTS software in favor of in-house develop is because 
available OTS software (a) did not do what was required for specific applications and lacked 
extensibility and adaptability to modify it accordingly, (b) had 3 user interface that was 
inappropriate or too cumbersome, (c) was too expensive, or (d) was inefficient in iu use of 
computer resources, Although most of these in-house application programs provide important 
features not available in other packages, each of these development efforts probably contain 
50-75% redundantly programmed code. In other words, in order to provide the new 
application -specific functionality, 11 was necessary to also redundantly develop functionary 
that was already available in several other programs. Cannibalism of previously developed 
code has been helpful in some circumstances, but most or this code is too intimately tied to the 
environment or the original program, or limits the flexibility or functionality of the new 
program. Considering that most of these in-house development teams consist of 1-3 
programmers, a considerable amount of effort is concentrated in programming functionality 
that already exist. 

As discussed above, the availability of more extensible and flexible visualization tools in the 
future may minimize, to some extent, the amount of in-house programming, and thus the 
amount of redundant programming. The extent to whiuh these and other OTS programming 
environment will meet our future development needs is still uncertain. However, there will 
probably still be requirements in the future for in-housc application -specific programming. 

The remaining problem then becomes how 10 reduce the amount of redundant programming, 
while meeting the requirement for application-specific programming within the EOS 
community. The answer probably lies in 3 combination or hierarchical and object-oriented 
development. 

The benefits of object-oriented programming Include the reusability of modular components, 
and the minimization, through abstraction, of complexities resulting from our heterogeneous 
computing environment. Hierarchical development implies that general components, which 
provide functionality common in many programs, be developed and made available to 
application-specific development efforts, thereby allowing these efforts to concentrate on 
providing value-added programming rather than redundant programming 


Simplification of Complex Heterogeneous E nvironmrnts 


Scientists and computer specialist are continuously required to operate in a complex 
heterogeneous computing environment. Data and applications exist on several computers 


running DOS, OS/2, Mac. Unix. VMS. or MVS operating systems, as well as having various 
graphics protocols and functionality. Data files reside on an array of storage devices in a wide 
variety of standard and proprietary data formats. Networks operate under a mix of protocols, 
with each protocol or hardware platform having its own peculiarities with regard to file 
transfer . Access to hardcopy or video devices is through serial, parallel, ethcrnct, or SCSI 
ports, with each port and each device requiring different protocols or file formats. The need to 
operate within this complex environment accounts for a significant loss or efficiency in the use 
of our computing tools. Several scientists have estimated that, at least. 60-70% of their 
computing efforts are consumed in dealing with the complexities of networks, dau formats, 
and operating systems, leaving very tittle time and energy for concentrating on scientific 
analysis. 

There are two potential solutions: homogenize the computing environment or make working 
within the heterogeneous environment simple and transparent. The latter is more realistic for 
both the present and the future While it is important to develop and adhere to common data 
formats, as well as standards in operating systems, network protocols, windowing 
environments, and graphics languages, it unreasonable to believe that our computing 
environment could ever become, and remain, homogeneous. Therefore, in order to make 
efficient use of our computing resources, it is vital that operating within this complex 
environment become simple and less painful. 

In essence, a scientist shouid be able to pull an icon representing data over to an application 
icon, and have the program operate regardless of underlying complexities. The scientist sitting 
on a Unix graphics workstation, for example, should not be required to deal with underlying 
complexities resulting from the possibility that the data resides on a different computer, in a 
formal incompatible with the application program, which possibly resides on a VMS-based 
computer. Furthermore, the scientist should be able to request a color hardcopy or video 
animation, with little concern as to the underlying complexities involved in accomplishing that 
request. 

There is no technical reason why such a solution cannot exist today. In fact, present efforts to 
provide Unix and DOS with a more Mac-like interface, as well as efforts directed toward object 
oriented operating systems, arc all major advances toward this goal. Still much emphasis and 
effort is required, in order to move the scientific computing community into such an 
environment. 


Developm ent of New Techniques and Components 

Although many of our immediate needs involve integrating present capabilities into useful and 
usable application for scientists, there is still a need to continue investigation and development 
of new techniques and components. In 3 few cases, this may involve a new approach to 
analyzing and visualizing data and require development of a complete application program. 

The NASA funded programs. Linkwinds. Interactive Image Spreadsheet (I1SS), and VisAD. arc 
examples of such. 
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However, in most eases, the development of new techniques should not require development or 
a complete application in order to be applied. Instead, newer techniques, such as spectral 
mixing, volume rendering algorithms, or accurate navigation routines, for example, should 
instead be developed as functional components or modules which can be added to existing 
applications. Unfortunately, the lack of module programming and coordination of development 
in the past, has generally resulted in the development a new application around each new 
technique. Adherence to object-oriented programming principles and the use of extensible 
software would maximize our future development efforts and minimize the cost ofleading edge 
development. 

Education and fomnmnicaiinn 

Many of the present challenges in placing appropriate visualization tools into the scientist's 
hands are the result of a simple lack of communication and education. Many scientists lack an 
understanding and appreciation of the importance of visualization in meeting their scientific 
objectives, while others lack an understanding of (he present limitations. Project managers 
aren't certain whether visualization needs exists and if these needs should be met with COTS 
software or in-house development. Many developers lack a true understanding of the needs of 
the scientists and arc often frustrated when scientists don’t appreciate the tools presented to 
them. In particular, hardware and software vendors must rely on their own restricted 
capabilities Tor determining the present and future needs of EOS scientists, generally obtaining 
a narrow view based on contact with a few select individuals. Even within NASA, itself, 
scientists, developers, and project managers are often unaware that efforts going on at other 
centers or within other organizations, might be beneficial and applicable to their own 
activities. 

Furthermore, scientists and specialists, alike, arc often unaware of the existence and 
functionality of many of the available tools that might be useful to them. It is a very time- 
consuming process to search out a specific tool that might serve one's need, to download or 
order it, install it on the hardware platform at hand, learn to operate it, get appropriate data 
into it, and then to test it for applicability to the scientific problem, at hand. If that tool does 
not meet the total needs, or in some eases does not meet the needs at all, then the search and 
testing process must begin again or be abandoned. Scientists and visualization specialists 
would benefit greatly from a system that not only informed them of the availability of 
appropriate visualization tools, but one that also provided them unbiased opinion and expert 
technical assistance in evaluating whether a given tool would meet their objectives. More than 
simply a repository of programs and data, scientists need knowledgeable assistance. 

Communication must improve both within NASA and between NASA and its contractors and 
vendors. Often, a lack of communication is responsible for NASA unnecessarily "contracting 
out for expertise", rather than using expertise that already exist within other organizations of 
NASA. EOS scientists must be made aware of tools that arc available to them, while 
developers must become better informed of the immediate and future needs of the scientists. 

Particularly if COTS software is to play a major role in meeting future needs, then vendors 
must become more of a true partner in the process. 
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The distribution and maintenance of application software is a significant challenge, 
particularly with regard to NASA funded or public domain software. As an in-house 
development project begins to reach a certain level or success, it is often faced with the 
dilemma of limiting the distribution and support or the software, or redirecting funding and 
manpower from development to support. Without the proper funding and mechanism for 
assisting in the transition from experimental to operational, distribution and user support can 
become a serious drain on the project effort. This results in either poor user support or in 
stagnation of development and growth of the program. Under such conditions, younger 
programs often never reach a useful level of maturity, while older efforts become archaic do to 
lack of program modification and upgrading. 

In essence, there is a need Tor assistance in transitioning experimental software Imo 
operational phases, primarily for distribution and user-support efforts. It is important that 
distribution and support efforts be established well before a program becomes completely 
polished and documented. In addition, there should be a mechanism for distributing 
unsupported or modular code, Since these 3rc often of significant use 10 in-house developers. 

In addition, there arc many public domain applications which can be of significant benefit to 
the EOS scientific community. If these efforts do not continue to receive adequate funding for 
product distribution, user support, and continued development, these projects arc forced to 
either dissolve or go commercial, often with disastrous results Tor the user and development 
alike. For applications found useful for meeting NASA's scientific objectives, it is generally 
more cost -effective to provide these projects some level of support through licensing or 
funding, than to replace the functionality of these program with in-house development. 

For COTS software, the major challenges lie primarily in the ability to test these programs 
before buying, and in establishing proper mechanisms for licensing useful software within a 
distributed environment. Improved site licensing mechanisms, such as network licenses, 
floating licenses and ’pay-as-you-osc’ licenses, would be useful. In addition, rapid 
procurement of useful, tested COTS software is essential. 

CONCLUSIONS 

Visualization is vital for meeting the scientific objectives of the EOS mission. Although our 
present suite of visualization techniques is impressive and powerful, the application software 
for putting these capabilities into the hands or the scientist is, at present, inadequate. The 
development of extensible, user-friendly, object-oriented software within the commercial 
software industry, is helping to increase the probability that COTS software can serve as a core 
for meeting many visualization needs. However, there will still be a need Tor in-house efforts 
directed at extending and modifying these tools to meet application -specific requirements, and 
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for developing leading edge techniques and applications not available from the commercial 
front. 

A follow-up report will investigate viable options for meeting the visualization requirements of 
the EOS mission in the future. These options will focus on two major objectives: increasing 
the effective use of visualization tools by scientists, and maximizing the return on development 
efforts. 
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AISRP PROJECTS 


PARALLEL COMPRESSION OF SPACE 
AND EARTH DATA 

John Relf, Hlllel Gazll, Tassos Markas 
(Research Assfstanl) - Duke University 




AA AA’ 


a'A A 


™:^ r9 \K r *' enWl0n ° f i h * MuHI-ReioluHon Vector Guantlzallon 
algorithm which compresses data by dividing an Imagfe block Into a 
number of variable-size subblocks, and encoding them separately 
using a quad-tree representation. 


PROJECT 

STARTING DATE 

SPACE TELESCOPE SCIENCE INSTITUTE 

July 1, 1992 

UNIVERSITY OF ILLINOIS 
TEXAS A&M 

June 4, 1992 
June 4, 1992 

UNIVERSITY OF MARYLAND 

June 1, 1992 

HUGHES AIRCRAFT COMPANY 

June 27, 1992 

UNIVERSITY OF ILLINOIS 

July 20, 1992 

UNIVERSITY OF COLORADO, BOULDER 

August 1, 1992 

UNIVERSITY OF WISCONSIN, MADISON 

July 1, 1992 

UNIVERSITY OF MARYLAND 

June 1, 1992 

MASSACHUSETTS INSTITUTE OF 
TECHNOLOGY 

June 1, 1992 

BRANDEIS UNIVERSITY 

June 1, 1992 

SPACE APPLICATIONS INTERNATIONAL 
CORPORATION (SAIC) 

July 1, 1992 

BOSTON UNIVERSITY 

July 1, 1992 

UNIVERSITY OF COLORADO, BOULDER 

June 1, 1992 

NATIONAL CENTER FOR ATMOSPHERIC 
RESEARCH 

July 9, 1992 


HIGH PERFORMANCE COMPUTING 
AND COMMUNICATION 


ADDITIONAL RESEARCH TASKS 


PROJECT OFFICE 

• To provide NASA Headquarters HPCC 
Project Office advice and support. 


Project Coordination 


Technical Advice/ Consultation 


George Washington University: 
9/90 - S/91 


George Washington University: 
10/90-9/92 


John Sibert and Cindy Starr 
Computer Graphics for Scientific 
Visualization 

Dan Spicer 

Rainald Lohner 

Computer Codes for Simulation 
of 3-D Compressible Magnet o 
Hydrodynamic Flows 


George Washington University: 
11/90-9/91 


Administrative Assistance 


Dan Spicer 

Burt Edelson and Hermann Helgert 
Supercomputer Networking for Space 
Science Applications 


Library Organization 


Personnel: 

Dr. Thomas Sterling 
Mr. Michael MacDonald 
Ms. Kimberly Dunn 


Penn State University: 
5/90 - 5/92 


Stanford University: 
3/91 - 3/92 


Eric Feigelson 

Research Assistant: Michael LaValley 

improved Data Analysis In 
Astrophysics 

Philip Scherrer and Richard Bogart 
AstroMail Development; an Electronic 
Mail System for the Astrophysics 
Community 


Brown University: 


Jeffrey Vitter and Paul Howard 
Data Compression Algorithms 


E 153 






CONSULTANTS 


FELLOWSHIPS 


Noah Friedland, University of Maryland 
Solving Inversion Problems in Atmospheric 
Sounding 

Working with Tony Guaftieri, Milt Halem, and 
George Serafino at NASA Goddard 


Dmitry A. Novik 

Data Compression, Image Analysis and Channel 
Efficient Coding 

Working with Jim Tilton & Manohar Mareboyana 
at NASA Goddard 


Elaine Roger, Johns Hopkins University 
Using Neural Networks for Machine Labeling of 
Satellite Data 

Working with Richard Kiang at NASA Goddard 


Leonard Dickens, University of Maryland 
Development of the SPUDS and IIFS System of 
the Intelligent Data Management Project 

Working with Nick Short at NASA Goddard 


FUNDED BY CRAY RESEARCH INC. 

• 1991/92 Douglas Smith, Carnegie Mellon 
University 
(H.T. Kung, advisor) 

Intermediate Language and Virtual 
Architectures for High Performance 
Image Processing 


1992/93 Kathleen Perez-Lopez, George Mason 
University 

(Arun Sood, advisor) 

Use of an Index/Browse Set of Images for 
Database Management 


WORKSHOPS 


OTHER TASKS AND ACTIVITIES 


1989 Computing Challenges in Managing Future 
Massive Image Systems 

Attended by 69 scientists and graduate 
students from 22 NASA, university, and 
private sector laboratories. 


1 990 The Role of Computer Science Research in 
the Mission to Planet Earth 

Attended by 238 scientists, graduate 
students, and business people. 


1 991 Workshop on Parallel Algorithms (WOP A ) 

Provided monetary support for workshop 
held May 9 * 10 in New Orleans. 


1992 Scientific Data Management 

To be held September 25. Expect 40-50 
participants. 


Technical Report Series 

* 86 titles currently. 

* Have filled requests for 1 200+ copies thru July 31 , 
1992. 

Seminar Series 

Have organized and presented Advances in 
Computational Sciences Seminar Series, a joint 
CESDIS/ Space Data and Computing Division series, 
since September 1989. 

Data Compression Conference 

* DCC ’91 April 7 - 1 1 , 1 991 , Snowbird, Utah 

Provided publicity, registration, and 
on-site conference support. 

* DCC ’92 March 24 -26, 1992, Snowbird, Utah 

Provided publicity, registration, and 
on-site conference support. 

Peer Review Support for NASA New Research 
Announcements 

* 1990 Applied Information Systems Research 

* 1992 Research in High Performance Computing 
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^^OTHER TASKS AND ACTIVITIES 




^ NEW CESDIS DIRECTIONS 



• Provide support to Minority University Space 
Interdisciplinary Network (MU-SPIN) project 




• Hire full-time CESDIS Associate Director. 



* Provide half-time administrative support for 
Program Coordinator. 




• Hire full-time CESDIS Staff Scientist. 



* Supported annual Working Group Conference in 
September 1991. 




• More coordination of AISRP Projects. 



* Will support Working Group Conference in 
September 1992. 




. Issue HPCC - CESDIS Call for Proposals. 



• Have provided access to work space and Sun 
computers for varying periods of time for 16 
visitors to NASA Goddard since September 1989. 




• Assist Earth and Space Science HPCC 
projects. 







- Build ties with ESDIS development 
(formerly EOSDIS). 
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Visualization is a vital link required to meet 
the scientific objectives of EOS 

Although there is a lot of power in our 
present techniques, the tools are not yet 
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Visualization Hardware Platforms 



specific application add-on modules general application quick & dirty fixes 










OSSA LIFE SCIENCES DIVISION RESEARCH 

REQUIREMENTS FOR 0SSA life sciences division 

SPACE STATION FREEDOM research requirements 



E 171 







E 172 

















E 175 











E 177 




JPL TRADITIONAL TECHNOLOGY TRANSFER JPL IMPLICATIONS OF TECHNOLOGY MATURITY 
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